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Description 



BACKGROUND OF THE INVENTION 
Field of the Invention 

5 [0001] The present invention relates to a system and method for controlling broadcast of and/or customer access 
to data such as video programs in accordance with objective profile data indicative of the customer's preferences 
for that data. More particularly, a preferred embodiment of the invention relates to a system and method for 
determining from objective profile data of the customers which data or video programming is most desired by each 
customer so that the customers may receive data or video programming customized to their objective preferences. The 
objective profile data is updated on a continuing basis to reflect each customer's changing preferences so that the 

10 content of the data channels or video programming may be updated accordingly. 

Description of the Prior Art 

[0002] The so-called "Information Super Highway" is expected to bring wondrous technological changes to 

15 society. Data of all kinds will become readily available to the public in quantities never before imaginable. Recent 
breakthroughs in video compression technologies are expected to extend the "Information Super Highway" right into 
the video realm by allowing customers to receive literally hundreds of video channels in their homes. While the 
prospects of opening a whole new world of information to the average person are exciting, there is much concern that 
the average person will simply be overwhelmed by the quantity of data piped into their homes. Sone techniques must 
be developed which permit the travelers of the Jnformation Super Highway to navigate through the plethora of 

20 available information sources without getting hopelessly lost. 

[0003] For example, in the home video context, it is desired to provide mechanisms which present the available 
video information to the customers in a comprehensible way. Such mechanisms should eliminate the necessity of 
"channel surfing" to find a program suitable for viewing out of the hundreds of video programming alternatives which 
are expected to be made available. The present invention is thus designed to help the customer of video and other 
data services to receive, with minimal effort, the information he or she is most interested in. 

25 [0004] Numerous systems are available which assist customers in determining which video programs to watch. For 
example, electronic program guides and the like are available which give customers on-screen access to the upcoming 
programming whereby the desired programming may be selected in advance for later recording. An early system 
described in U.S. Patent No. 4,170,782 to Miller allows the viewer to preselect a television viewing schedule of 
desired television channels to be viewed during subsequent time periods. Miller also monitors the television 

30 programs actually watched by the television viewer and relays this information to a central data processing center 
over a communication link. Subsequer interactive cable systems, such as that described by Freean in U.S. Patent No. 
4,264,924, permit the viewer to select the information to be received on particular channels. The cable system 
described by Freeman also provides individually tailored messages to the individual viewers. Similarly, Young 
disclosed in U.S. Patent No. 4,706,121 a system which permits the viewer to select programs from schedule 
information by controlling a programmable television tuner to provide the broadcast signals for the selected 

35 programs to the television receiver at the time of broadcast. This system can also be used to control a VCR for 
unattended recording of the selected programs. Further details of such a VCR recording system is provided by Young 
in U.S. Patent Nos. 4,977,455 and 5,151,789. Other systems, such as that described by Reiter et al. in U.S. Patens 
No. 4,751,578, provide updatabie television programming information via telephone link, magnetic cards, floppy 
disks, television or radio subcarrier, and the like, to the viewer's television screen in such a manner that tie 

4Q viewer may selectively search this information. 

[0005] Unfortunately, in each of the aforementioned prior art systems, the customer must actively select the 
desired programming. In other words, these systems facilitate access to programming designated by the customer but 
provide no assistance to the customer in determining what programming to select for subsequent viewing. With the 
possibility of several hundred video channels soon becoming available io video customers, additional systems are 
desirable which assist the customer in selecting the desired programming. 

45 [0006] The system described by Herz et al. in U.S. Patent No. 5,351,075 partially addresses the above problems, 
at least with respect to the provision of movies over cable television. As described therein, members of a "Home 
Video Club" select the video programs they would like to see in the following week. A scheduling computer receives 
the members' inputs for the current week and determines the schedule for the following week based upon the tabulated 
preferences. This schedule is then made available to the members of the Home Video Club. If, when, and how often a 
particular video program is transmitted is determined by the customer preferences received by the scheduling 

:nJ computer. Prime time viewing periods are used to make certain that the most popular video programs are broadcast 
frequently and at the most desirable times. As with the aforementioned systems, the "Home Video Club" system does 
not automatically broadcast the most desired video programs to the customers but instead requires the active 
participation of the customers to "vote" for the most desired video programs for subsequent viewing . 
[0007] It is desired to extend a customer preference system such as the "Home Video Club" to include general 

55 cable programming offerings and to minimize active customer involvement in the determination of the desired 
programming. Unlike the movie scheduling system described in the "Home Video Club" application, the number and 



content of general cable programming channels is scheduled in advance and typically cannot be chanqed bv the 
customer through a simple voting system. As a result, the customer can only vary his or her video programming by 
changing channels. In other words, the customer typically illustrates his or her programming preferences by changing 
channels Indeed, such changes are monitored by Nielsen, Arbitron, and other ratings agencies in setting the rates 
for advertising. In U.S. Patent No. 5,155,591, one of the present inventors carried this concept a stej 9 furthe? bv 
obtaining information about the customers and then denographically targeting television commercials to the cistomers 
most likely to respond favorably to such adverting. Unfortunately, however, this demographic and cistomer 
preference information has not been specifically described for providing customized channels which better reflec 
the customers' preferences for the programming itself. Ieci 
[0008] The present inventors have found that the afore-mentioned problems may be overcome by creatine 
customized programming channels from all of the programming available at any time and broadcasting the customized 
programming channels to groups of customers. The customer's set top multimedia terminal then creates "virtual 
channels as a collection of the recerved programming data from one or more of the customized programming channels 
at any pom in t.me for receipt on the customer's television. These virtual channels are received as an additional 
offering to the regular broadcast transmission and are customized to the customer's preferences Thus as used 
herein, a virtual channel is a channel formed as a composite of several source materials or programs which may or 
may not change during respective time periods to reflect the programming most desirable to the customer during that 
time period The creation of such "virtual channels" is intended to minimize the amount of "channel surf"? 
necessary to find the most preferred video program at a particular time 

KSSLh P rr reVi ° US a " empts ,fL pr ° v i din ,9 such selective ac cess to programming have required active customer 
nfS h t f am K P ' e K : n , U ' S - PatSnt N °- 4 ' 381 - 522 - Lambert disclosed a system in which the customer is 

permitted to specify which television signal source is to be connected to the video switch for broadcasting of a 

?o 6 fhfr Sfn V ^L 0 r n S r0g h ram t tO i he , CUStOmer ' Th , e desired pr ° grarn is Selected from * P^ram schedule channel provided 
to the customer Hashimoto discloses more elaborate systems in U.S. Patent Nos. 4,745,549 and 5 075 771 in which 

STn, 3 " if t '° 't dMdUal f customer ' s ,astes are elected from all of the available television ' programs in 
accordance with the customer preferences specified on a customer questionnaire or provided from the customer over a 
telephone link or the like. The viewer preference data provided using the questionnaires, the telephone lines and 
™™ L wh n h C K y P roce f se l b y. linea ' programming to provide an individual subscriber television 
program hst which may be used by the video provider to select which programs to broadcast to particular 
indMduals.. Subscriber complaints about the program list are used to "tune" the television program list to better 
match the individuals tastes. An automatic controller is also used to automatically control a television or video 
cassette recorder in accordance with the subscriber's specified tastes. However, the system disclosed by Hashimoto 
works from Med objectrve data provided by the customer in response to a questionnaire and provides no ^chanfem 
for validating the accuracy of the profile of that customer other than through the use of a complaint system h 
addition the system disclosed by Hashimoto does not determine the desirabfiity of particular video programs but merely 
allows the customer to charactenze those types of programs to which he or she may be most interested 
k°i£L hSLm 8 reaSOnS h no ' ed fove, feedback regarding the customer programming and purchasing preferences 
is highly desirable. It >s highly desirable to develop a technique for better acquiring and quantifying such 
ul pXnfwn^^^r 9 and P urch a sin 9 Preferences. Along these iines, Sirubbe recently described a syttem in 
U.S Patent No. 5,223.924 which provides an interface for automatically correlating the customer preferences with 
he television program .nformation and then creating and displaying a personated customer program database from 
the results of the correlation In the Strubbe system, the customer specifies whether he or she "likes" a pSaT 

™H r °nr m Ti! .1 13 accordin 9 lv - Then ' from 'he video programs "liked" by the customer a 

second, personalized, database is created. However, as with each of the systems described above, the Strubbe system 
^ evel °P cus ^ m er Profiles and automatically update the database of "liked" videos using feedback/Also 
m2£» tTiu le f h , lahe Preference information maybe used to predict what new video programs the custome 
may like and then schedule those new video programs for viewing. 

[0011] Those in the technical press have fantasized about so-called "smart" televisions which will keep track 
of past viewing preferences and suggest new programs that match the customer's personal tastes so that her 
customer need not channel surf" through the 500 channel video system of the future. However, prior to the present 
r e nlT; ?° 0ne f kn0Wr : t0 ,ha Feaent inventors has been able to make such "smart" televisions a reality, indeed 
the present invention is believed to be the first system to create "virtual channels" of recommended programming for 
each customer of a video or other data service. Fiuyiainming lor 

[0012] WO-A-9414284 describes a programmable set-top terminal for a television program delivery system which 
suggests programs. The terminals can be reprogrammed and programs can be selected and displayed to suggest to 
subscribers for viewing The terminal memory can be reprogrammed remotely and data can be gathered and analysed 
for selecting programs to suggest to a subscriber. y 
[°°ll J • , EP -A-0572090 describes a system comprising a user interface for accessing downloaded television 
program information which can be continually updated and which can automatically correlate this information with the 

the oorrelafon ^ ^ 3 ' ' e3St 006 inf ° rmati ° n database based on u P on ^ re" ' 

SUMMARY OF THE INVENTION 



[0014] In accordance with a first aspect of the present invention there is provided a method of scheduling 
customer access to data of a plurality of data objects as defined in claim 1 . 

[0015] In accordance with a second aspect of the present invention there is provided a data transmission system 
which schedules customer access to data of a plurality of data objects as defined in claim 11. 

[0016] The present invention thus relates to a system and method for making available the video programming and 
other data most desired by the customer by developing an "agreement matrix" characterizing the attractiveness of 
each available source of video programming or data to each customer. From the agreement matrix, one or more Virtual 
channels" of data, customized to each customer, are determined. At any given time, the one or more virtual channels 
include the video programming or other data which is predicted to be most desirable to the customer based on the 
customers preferences. The virtual channels are determined by selecting from the available alternatives only those 
video programs or other data which most closely match the customer's objective preferences. 

[0017] A method of scheduling customer access to data from a plurality of data sources is provided Although 
the technique of the invention may be applied to match customer profiles for such disparate uses as computed 
text retrieval, music and music video selection, home shopping selections, infomercials, and the like in the 
presently preferred embodiment, the method of the invention is used for scheduling customer access to video programs 
and other broadcast data. In accordance with the preferred method, objective customer preference profiles are 
obtained and compared with content profiles of the available video programming. The initial customer profiles are 
determined from customer questionnaires, customer demographics, relevance feedback techniques, default profiles 
and the like, while the initial content profiles are determined from questionnaires completed by "experts" or some 
sort of customer's panel, are generated from the text of the video programs themselves, and/or are determined by 
adopting the average of the profiles of those customers who actually watch the video program Based on the 
comparison results, one or more customized programming channels are created for transmission and from those 
channels, each customer's set top multimedia terminal may further determine "virtual channels" containing a 
collection of only those video programs having content profiles which best match the customer's profile and hence 
are most desirable to the customer during the relevant time frame. 

[0018] Preferably, one or more customer profiles are created for each customer of the video programs These 
customer profiles indicate the customer's preferences for predetermined characteristics of the video programs and 
may vary in accordance with time of day, time of the week, and/or customer mood. Such "characteristics" may include 
any descriptive feature suitable in describing particular video programs, such as classification category directors- 
actors and actresses; degree of sex and/or violence; and the like. Corresponding content profiles are created for 
each video program available for viewing and generally indicate the degree of content of the predetermined 
characteristics in each video program. An agreement matrix relating the customer profiles with the content profiles 
is then generated. Preferably, the agreement matrix enables the system to determine a subset of the available 
programs at a particular point in time which is most desirable for viewing by the customer. The determined subset of 
video programs is then presented to the customer for selection in the conventional manner, except that each "virtual 
channel includes a collection of the offerings available on all of the originally broadcast channels from the cable 
system. The virtual channels" are then generated by the customer's set top multimedia terminal for display on the 
customers television. The customer may then select the desired video programming, which mayor may not include the 
programming offered on the "virtual channels." Similar techniques are used at the video head end to determine which 
video programs to transmit to each node for use in the creation of the "virtual channels" at each customer's set top 
multimedia terminal. K 

[0019] Preferably, the customer profile creating step comprises the step of creating a plurality of customer 
profiles for each customer, where the plurality of customer profiles are representative of the customer's changing 
preferences for the predetermined characteristics in accordance with time of the day and of the week In such an 
embodiment, the agreement matrix determining step comprises the step of using different customer profiles for each 
customer in accordance with the time of the day and of the week, thereby reflecting changes in the customer's 
preferences or moods" during the course of the week. In addition, the customer profile creating step preferably 
comprises the step of clustering customer profiles for combinations of customers expected to view the video programs 
at a particular customer location at particular times on particular days. For example, the clustered profiles for a 
customers residence may contain the combined profiles of Mom and Dad in the evening and the combined profiles of 
the children in the afternoon. In this embodiment, the agreement matrix determining step comprises the step of using 
the different clustered customer profiles in accordance with the time of the day and of the week. Alternatively the 
appropriate customer profiles for use in calculating the agreement matrix may be determined directly from identity 
• information received from the customer or assigned to the customer in accordance to the cluster of customers to 
which that customer belongs. In the latter technique, it will be appreciated that customer profiles are not strictly 
necessary since each customer is assigned an initial customer profile determined from the clustered profiles of the 
other customers in his or her cluster of customers. 

[0020] In the presently preferred embodiment of the invention, the agreement matrix determining step comprises 
the step of comparing the customer profiles with the content profiles for each video program available for viewing 
in a predetermined time period, in particular, the agreement matrix determining step preferably comprises the st°p 
of determining a distance in multidimensional characteristic space between a customer profile and a content profile 
by calculating an agreement scalar for common characteristics, ac, between the customer profile, cv, and the content 
profiles, cp, in accordance with the relationship; 



acuity + Z k w lk \cv ik -c PiJ \], 

for i = a particular customer of a number of customers I, j = a particular video program of a number of video 
programs J, and k = a particular video program characteristic of a number of video program characteristics K 
where W, k is customer i's weight of characteristic k. As will be appreciated by those skilled in the art, an 
agreement matrix so defined is the reciprocal of the distance d (= l/ac) in multi-dimensional space between the 
customer profile vector and the content profile vector and that many different distance measurement techniques may 
be used in determining the distance d. In such an embodiment, the subset determining step preferably comprises the 
steps of sorting the video programs in an order of ac indicating increasing correlation and selecting as the subset 
a predetermined number of the video programs having the values for ac indicating the most correlation. 
[0021] When scheduling video programs at a head end using the techniques of the invention, the agreement matrix 
is preferably determined from customer profiles of a plurality of customers and the video programming is scheduled 
using the steps of: 

(a) determining a video program j which most closely matches the customer profiles of the plurality of customers 
of the video programs; 

(b) scheduling the video program j for receipt by the plurality of customers and decrementing a number of 
channels available for transmission of video programs to said customers; 

(c) when the number of channels available for transmission of video programs to a particular customer of the 
plurality of customers reaches zero, removing the particular customer from the plurality of customers for 
scheduling purposes; and 

(d) repeating steps (a)-(c) until the number of video programs scheduled for transmission equals the number of 
channels available for transmission of video programs. 



[0022] In accordance with a currently preferred embodiment of the invention, a passive feedback technique is 
provided for updating the customer profiles in accordance with the video programming actually watched by the 
customer Such a method in accordance with the invention preferably comprises the steps of: 

creating at least one customer profile for each customer of the video programs, the customer profile indicatinq 
the customer's preferences for predetermined characteristics of the video programs; 

creating content profiles for each video program available for viewing, the content profiles indicating the 
degree of content of the predetermined characteristics in each video program; 

monitoring which video programs are actually watched by each customer; and 

updating each customer profile in accordance with the content profiles of the video programs actually watched by 
that customer to update each customer's actual preferences for the predetermined characteristics. 



[0023] Preferably, the monitoring function is accomplished by storing, at each customer's set top multimedia 
terminal, a record of the video programs actually watched by the customer at the customer's location and in the 
case of a system with a two-way communication path to the head end, polling the set top multimedia terminals of all 
customers to retrieve the records of the video programs actually watched by the customers at each customer location 
Also, from the retrieved records, combined customer profiles may be determined which reflect the customer profiles 
of a plurality of customers. Then, by determining the agreement matrix using the combined customer profiles for each 
node, programming channels containing the video programming which are collectively most desired by the customers 
making up the combined customer profiles may be determined for transmission from the head end to each of the 
customers connected to the same node. 

[0024] When a predicted video program is not selected by the customer, it is desirable to update the agreement 
matrix to better reflect the customer's tastes. The updating of the agreement matrix may be accomplished in a 
variety of ways. For example, the customer profile, cv ik , for customer i and video program characteristic k may be 
adjusted to a new customer profile, cv ik ', in accordance with the equation: 

c V = cv ik-A(cv ik -c Pjk ), 

where cp jk represents the degree of video program characteristic k in video program j and a is a small constant which 
can vary in accordance with the desired accuracy for the profiles. On the other hand, customer i's weighting of 
video program characteristic k, w jk , in the customer profile, cv lk may be adjusted to a new weighting, W ' in 
accordance with the equation: 



w *' = K : A|CV [k - cp Jk |) / Z k (W jk - A|CV lk - cp jk |). 



[0025] In addition, the content profiles, cp jk , of certain video programs j having video program characteristics k 
may be adjusted to new content profiles, cp,*', to update the customer profiles of customers i who actually watch 
video program j, in accordance with the equation: 

c Pik' = c P jk -A(cv |k -cp ), 

wnere cv lk represents the customer profile of customer i for video program characteristic k. Of course other 
updating techniques are also possible within the scope of the invention. 

[0026] Since the data passing from the set top multimedia terminal to the head end contains data which the 
customers may consider to be confidential, the two-way transmission system of the invention may be modified to 
encrypt the transmissions from the set top multimedia terminals to the head end. Similarly as in the case of pav- 
per-view programming, it is often desirable to encrypt the transmissions from the head end to the set top multimedia 
terminals. In accordance with the invention, a secure transmission system from the head end to the set too 
multimedia terminal is obtained by performing the steps of: " 

(1) At the set top multimedia terminal, generate a seed random number N to be used for the random number 
generator. 

(2) Retrieve the public key P from the head end and encrypt the seed random number N as E(N P) at the set too 
multimedia terminal using a public key algorithm such as RSA which is known to be difficult to break. 

(3) Send the encrypted seed N (E(N,P)) to the head end where E(N.P) is received and decrypted to yield N usinq 
the head end s private key Q. a 

(4) The head end and set top multimedia terminals then initialize their respective pseudo-random number 
generators with N as a seed. 

(5) Begin the encryption at the head end by generating the first number in the sequence K, and logically exclusive- 
ORmg it with the first data word in the stream P„ thereby forming C, (i.e., C, = EOR(K„ P,)). 

(6) Send the result C, from the encryptor at the head end to the set top. 

(7) Form K, at the synchronized random number generator of the set top multimedia terminal, which has also been 
initialized with seed N, by decrypting the received C, to yield P,. This is done by exclusive-ORing K, with C to 
yield P, (i.e., P, - EOR (K,, C,)), generating the next pseudo-random K, in the sequence at the head end and'the 
set top multimedia terminal, determining whether all words i in the sequence have been decrypted and repeatina 
steps (5)-(o) until all words in the digital video stream have been decrypted. Normal processing of the digital 
video stream continues from that point. Secure transmission from the set top multimedia terminal to the head end 
is obtained in the same manner by reversing the set top multimedia terminal and the head end in steps (1 y(7) above 



[0027] Those skilled in the art will appreciate that the techniques described herein are applicable to numerous 
other areas of technology in which it is desirable to assist the customer in the selection of a data service which 
best meets that customer's needs. For example, the agreement matrb< may be used at a head end to minimize the 
amount of data, such as electronic program guide data, which is to be stored at the user's terminal thereby 
min.miz.ng memory requirements of the user's terminal. As another example, the agreement matrix of the invention may 
be used to facilitate text retrieval in a computer database system and may be implemented in a kiosk or personal 
computer designed to assist in the selection of movies, music, books, and the like. All such embodiments will become 
apparent to those skilled in the art from the following description of the preferred embodiments. 

BRIEF DESC RIPTION OF THE DRAWINGS 

[0028] The above and other objects and advantages of the invention will become more apparent and more readily 
appreciated from the following detailed description of the presently preferred exemplary embodiments of the 
invention taken in conjunction with the accompanying drawings, of which: 

Figure 1 is a flow chart illustrating the flow of processing of the customer and content profile data in 
accordance with a preferred embodiment of the invention. 

Figure 2 is a flow chart illustrating the method of selecting the contents of channels which are to b- 
transmitted from a CATV head end to a plurality of customers in accordance with the invention. 



Figure 3 is a flow chart illustrating the method of passively updating customer profiles in accordance with the 



invention. 



Figure 4 is a generalized system overview of a one-way customer profile system in -accordance with the invention 
in which customized virtual channels are created at the set top multimedia terminals from the channels rec-ived 
from the CATV head end. 

Figure 5 is a generalized system overview of a two-way customer profile system which expands upon the 
embodiment of Figure 4 by feeding back data representative of the customers' viewing habits from the customers' 
set top multimedia terminals to the CATV head end for purposes of optimally scheduling the channels for 
transmission from the head end in accordance with the recorded customer preferences. 

F '?u Ure u 6 u° Ck d ' a9 ? m ° f 3 Cable television distribution system, including an optional two-way return 
path, which has been modified to transmit video programs determined from the fed back customer profile data in 
accordance with the techniques of the invention. 

Figure 7 is a flow diagram of an upstream encryption technique for encrypting data sent from the set to D 
multimedia terminal to the head end in accordance with the techniques of the invention. 

Figure 8 is a flow diagram of a downstream encryption technique for encrypting data sent from the head end to 
the set top multimedia terminal in accordance with the techniques of the invention. 

Figure 9 is a block diagram of the software used in the set top multimedia terminals in a preferred embodiment 
of the invention. 

Figure 10 is a block diagram of a preferred hardware implementation of a set top multimedia terminal in 
accordance with the invention. 

Figure 11 is a simplified block diagram of a computer kiosk or personal computer which uses the profile and 
clustering techniques of the invention to assist a customer in the selection of videos for rental music or 
books for purchase, and the like. 

DETAILED DESCRIPTIO N OF TH E PRES E NTLY PREFERRE D EMBODIMFNTS 

[0029] The present invention will be described in detail below with respect to Figures 1-11 Those skilled in 
the art will appreciate that the description given herein is for explanatory purposes only and is not intended to 
hmit the scope of the invention. For example, while the invention is described with respect to a cable television 
broadcasting system those skilled in the art will appreciate that the system described herein may also be used for 
m ^ n 5 reC ff P d fl es,re ddata se^ices and shop at home services, and for selecting from available music and 
multimedia offerings. Accordingly, the scope of the invention is only to be limited by the scope of the appended claims. 

I. OVERVIEW 

• [0030] The present invention relates to a customer profile system in which the characteristics of a data source 
are quantified in some objective manner and stored as content profiles and the customer's preferences for 
those characteristics are stored in the form of one or more customer profiles. In the following detailed 
description the present inventors will describe how the techniques of the invention are used for creating content 
profiles which characterize the data sources in accordance with their degree of content of predetermined 
characteristics Techniques will also be described for creating, weighting, and updating customer profiles which 
reflect the customer s afrmity for those predetermined characteristics. From the content profiles and the customer 
praxes, an agreement matrix will be described which matches the customers' preferences and the contents of the 
data sources available at any point in time. As will be described in detail below, the agreement matrix is used at 
the customers set top multimedia terminal to create "virtual" data channels from the available data sources or 
alterna lively the agreement matrix may be used by the data provider to determine which data sources of those 
available will have the most appeal to his or her customers. 

[0031] A preferred embodiment of the invention will be described in the context of a one-way CATV transmission 
system and a two-way transmission CATV transmission system with feedback for adjusting the agreement matrix 
thwTbe d 0 escribed Umer0US a " ematiVe embodiments for application of the techniques of the invention will 

II. Content Profiles and Customer Profiles 

[0032] . As noted above, a preferred embodiment of the invention will be described for application in a CATV 
distribution system for aiding a customer in the selection of video programming for viewing by matching the 



available video programming to each customer's objective preferences. Accordingly, the content and customer profiles 
will include characteristics which are useful in defining the characteristics of video programming. Of course, when 
the present invention is used to assist in the selection of data from other data sources, the content and customer 
profiles will include completely different characteristics. 

[0033] In accordance with the preferred embodiment of the invention, the content profiles describe the contents 
of video, programs and are compared mathematically in a computer to customer profiles to generate an agreement 
matrix which establishes the degree of correlation between the preferences of the customer or customers and the 
video programming available during each video programming time slot. The content profiles and the customer profiles 
are thus described as a collection of mathematical values representing the weighted significance of several 
predetermined characteristics of the video programming. For ease of description, the present inventors will describe 
the mathematical basis for the content profiles and the customer profiles in this section and will describe the 
generation of the agreement matrix and the uses of the agreement matrix in the next section. 

A. Terminology 

[0034] The following subscription indices will be used throughout this specification: 
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customers 


(i = 1 ,2, . 


. i); 


j 


programs 


0 = 12... 
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characteristics 


(k=1,2,. 
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categories 


(1 = 1,2,. 
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and the following variables will be used throughout this specification: 
cv ik : customer i's rating for characteristic k; 

CVj : the vector {CV ik |k e K} which forms customer i's profile for all characteristics k; 

sv ik : spread (flexibility) in viewer i's rating for characteristic k; 

wv lk : customer i's weight of characteristic k; 

cp jk : objective weighting of program j for characteristic k; 

CPj : the vector {cp 3k |k e K} which forms program j's profile for all characteristics k; 

sp jk : spread (flexibility) in program j's rating for characteristic k; and 

ac y : agreement scalar representing similarity between CV, and CPj 

[0035] It should be noted at the outset that cv ik indicates customer i's preferred level for characteristic k, 
while cp jk indicates the level of presence of a characteristic in the program. sv jk and sp jk , on the other hand, 
respectively represent customer i's flexibility in accepting different levels of characteristic k and the 
flexibility in the determination of the degree of content of characteristic k in program j. 

[0036] cv and cp may have values between 0 and +10, where the actual range number indicates the relevance of 
that characteristic. In other words, a video programming having a value of +10 for a given characteristic has the 
highest degree of content for that- variable. The values of cv and cp should always be non-negative, since both are 
related to the level of characteristics, the former being the desired level and the latter being the actual level. 
Naturally, zero in cv means the customer's rejection of a characteristic, while zero in cp indicates the absence of a 
characteristic in a program. Those skilled in the art may wish to allow cv to become negative so that the magnitude of 
negativity could indicate the level of the customer's aversion to a characteristic. However, there are drawbacks in 
using negative values in this manner. First, a negative value will blur the meaning of cv and sv and weaken the 
statistical basis for their calculation. Second, cv has been defined as the desired level of a characteristic, which 
should be non-negative. Thus, the level of aversion is preferably expressed as a point value, instead of a range of 
values. 

[0037] Preferably, the level of aversion is expressed by the combination of a zero-value in cv and a certain 
value in the corresponding wv. For example, when customer i totally rejects characteristic k, cv^ can be set to -1, 
which means prohibition. Any program k for which cv ; * < 0 will be excluded from the recommendation list for 
customer i. Cf course, as in the Strubbe system, the values could simply be "0" or "1" to indicate the presence or 
absence of a characteristic. 

[0038] wv tt illustrates the importance of characteristic k to customer i. Typically, different characteristics 
bear different levels of importance for a customer, and the introduction of this variable catches the variation. 
Although any scaling system may be used, the weight variable, wv, may simply weight the associated characteristic 
on a scale of 0-5, where 5 indicates the highest affinity for the associated characteristic. On the other hand, as 
in the Strubbe system, the weights could simply indicate a "like" or "dislike" value for each characteristic. 
[0039] Finally, as will be described in more detail below, the agreement scalar for characteristics, ac, is the 
weighted average of the values of a variant of the two-sample t test for significance between CV and CP. 



B. Creating Initial Customer and Content Profiles 



[0040] A profile, either of a customer (Customer Profile) or of a program (Content Profile), is composed of 
arrays of characteristics which define the customer profile vector CV, and the program profile vector CP, To increase 
the accuracy in statistical estimation, the selection of the characteristics should follow the following guidelines: 

o The characteristics should be descriptive of the features of the programs; 

o The list should be fairly inclusive, i.e., include all the common features of the programs; and 

o There should be no synonyms, nor much overlapping in meaning between two or more characteristics In other 
words, the correlations between the characteristics are desirably minimized. 

[0041] For example, characteristics currently in use for characterizing video programming include film genres 
such as westerns, comedies, dramas, foreign language, etc. as defined by the American Film Institute and/or as 
provided via existing television database services; directors; actors/actresses; attributes such as amount and deqree 
of sex, violence, and profanity; MPAA rating; country of origin; and the like. Of course many other 
characteristics may be used, such as those characteristics used in the Minnesota Psychological Test' or other more 
particularized categories based on life experiences and emotions. Such characteristics may also be value based by 
indicating the scientific, socio-political, cultural, imagination evoking, or psychological content as well as the 
maturity level to which the program appeals. 

[0042] In accordance with the invention, there are several ways to develop the initial customer and content 
profiles for such characteristics. For example, the initial customer profile may be assigned on the basis of the 
customer's zip code or other characteristic demographic information. In other words, the profile may b« set to a 
profile typical of the customer's zip code area or to a typical profile determined by interviews or empirically bv 
monitoring what customers watch. Similarly, each customer may be assigned a generic customer profile which is 
personalized over time through the profile adjustment techniques to be described below. Alternatively a customer 
may be asked to name several of his or her favorite movies and television shows so that an initial customer profile 
may be determined by combining or averaging the content profiles of the selected movies and television shows In 
addition, each customer may complete a ballot for each viewing mood. This latter approach builds upon the technioue 
described by Hashimoto in U.S. Patent Nos. 4,745,549 and 5,075,771 and will be described in some detail here 
[0043] For explanation purposes, it will be assumed that the initial customer profile is determined from an 
initial customer questionnaire or ballot. When completing the initial questionnaire, the customer may choose betwe-n 
two voting schemes, one (Scheme A) by characteristics, and the other (Scheme B) by categories Scheme A Is 
straightforward. In Scheme A, the customer gives acceptable ranges for all the characteristics which identify a 
video program. The customer's profile { cv | cv € CV } is immediatelv obtained by simply calculate th» m*ans -f 
these ranges. In Scheme B, however, the customer gives a specific rating for each of the categories" If ccv 7 is the 
rating for i customer i for category 1, with a scale of 10 in which zero means least satisfaction with the cateqory 
and 10 means the greatest satisfaction, the customer's profile { cv //f | cv ik c cv,.} may be calculated as: 

cv jk = 1/N L , * Z |t cc kj , (1* € L') Equation (1) 
where U is the set of categories in which ccv, = max, ccv fff and N L . is the cardinality of L\ In other words a 
customer's rating of a characteristic equals the objective content rating of that characteristic in the customer's 
most preferred category. If there are multiple most-preferred categories, indicated as ties in ccv, the objective content 
ratings can be used. 

[0044] Alternatively, the customer may be required to input an upper limit cvu* and a lower limit cvk for 
characteristic k to show his/her acceptable range for characteristic k, where cv,* is calculated as: 

cv ix = ( c ^ ik + cvi |k ) / 2 Equation (2) 
i.e the middle point of the range. If customer i wants to indicate his/her indifference to characteristic k ie 
that he/she can accept any level of characteristic k, then he/she may either let cvu jk = 1 0 and cv\ ik = 0, or let wv- =0.' " 
[0045] The initial value for cp,-* is calculated as the mean of all votes on that characteristic by "experts" or 
other viewers used to characterize the video programs. As will be noted below, initial profiles for video programs 
may be obtained by using a panel of experts or customers to assign content profiles or by assigning the customer 
profiles to the video programs on the basis of those who liked the video program during a test screening On the 
other hand, the initial value for sv ik is calculated as: 

sv lk = (cvu ik - cvl. k ) / Z, Equation (3) 
where Z preferably equals 2. The calculation simulates that for standard deviation, where the cuttinq point for 
rejection in normal distribution is usually when the divisor Z = 2. Accordingly, cv is of point value and sv \s of ranqe 
value. Thus, while the value of the divisor Z in Equation (3) may be altered to tighten/loosen the cutting point 



the divisor in Equation (2) may not be changed. sp t on the other hand, may be calculated as the standard deviation 
in the experts' or test group's votes on cp. 

[0046] The major advantage of Scheme B is that much burden will be taken away from the customer in the ballot 
completion process, since as a rule, the number of characteristics well exceed the number of categories. The 
disadvantage, however, is the inaccuracies due to the fact that not all the characteristics in the customer's 
favorite category are on the customer's most preferred level. The inaccuracy may be reduced by expanding the most- 
preferred categories to those with ccv values in a certain upper percentile, rather than with the maximum ccv values. 
The value of sv can be derived from the deviation of ccv's in the customer's most-preferred categories. To help the 
customers vote more consistently, each category is preferably accompanied by a list of keywords (i.e., 
characteristics that are relevant to the category). 

[0047] Similarly, the content profile may be determined using questionnaires completed by a panel of experts or 
customers who determine the content of all video programming available for broadcast. The same scaling systems 

10 would be used as for the customer profiles. For statistical purposes, it is desired that the expert or customer 
panel be as large as possible. As will be noted below, once the system of the invention is in operation, the 
customer profiles of those who actually watch a particular video program during an initial screening by a sample 
group may be used to assign a content profile to that program for subsequent viewings. In such a system, those 
customers who watch the program from beginning to end will be presumed to have "liked" the program. The customer 
profiles of those who "liked" the program would then be combined to create the initial profile for that program. 

15 That program would then be ready for broadcast to all members of the network. 

[0048] Alternatively, in the presently preferred embodiment, more sophisticated techniques are used to generate 
the initial content profiles. In the preferred embodiment, the content profile of a program is determined 
automatically from the word frequency of certain words in the text or on-line description of a program or the 
frequency of certain words in the closed captions of a television show, where such words are chosen as 

20 representative of certain categories. Of course, other simpler techniques such as one which simply determines the 
presence or absence of particular characteristics may be used within the scope of the invention. 

[0049] The weighting of the characteristics in the customer and content profiles somewhat depends on how the 
profiles were determined initially. For example, the weighting of the customer profiles may be obtained directly 
from a questionnaire by asking the customer to appropriately scale (from 1-10) his or her preference for each 
characteristic. On the other hand, if the customer profile is assigned based on demographics, zip code, and the 
25 like, the average weights for other customers with the same demographics, zip code, and the like may be used. When 
more statistical techniques are used for creating the initial customer and content profiles, the weights may be 
determined mathematically as, for example, the reciprocal of the standard deviation of the characteristics. Of 
course, other weighting techniques may also be used. 

3Q C. Adjusting Customer Profiles 

[0050] As noted above, the data from which the initial customer profile is derived may be obtained through 
ballot filling, whereby a number of characteristics are listed and the customer gives his/her preference rating (cv) and 
flexibility range (sv) for each characteristic. However, people often do not provide all of the necessary responses or 
the correct responses to such ballots or questionnaires. Similarly, when the initial customer profiles are assigned 
35 to new customers on the basis of demographics and the like, there is a substantial likelihood that the initial 
customer profile will need considerable adjustment. Moreover, the system should account for the fact that many 
people's tastes change over time. Thus, to ensure accuracy of the profiles, there must be some way to correct errors 
in the initial customer profiles and to adjust the customer profiles over time. 

[0051] In accordance with the invention, a passive feedback technique is provided whereby the programming 
4Q viewed by the customers are automatically monitored and used to adjust the customer profiles. That technique will be 
described in more detail in Section V below. This section will instead refer to an active feedback mechanism which 
will be referred to as a "rave review." 

[0052] . As noted above, one way to establish an initial customer profile is to show an unrated program or 
portion of a program to a target audience and to assign to the unrated program a combination of the customer 
profiles of those who actually watched the program or portion of the program from beginning to end or to assign 
45 ratings inputted by those who completed a survey. A similar technique may be used for error correction or for 
creating initial customer profiles. In particular, the customer is exposed to a series of short sections of 
different video programs. .Each section is characterized by a few characteristics, and the assigned characteristic 
level of each of the characteristics is presented to the customer. The customer is then asked to state his/her most 
preferred level for the characteristic given the assigned characteristic level for the viewed section of the video 
program. For instance, if the level of "action" in a section of the movie "First Blood" is assigned a value of 8, 
the customer may give 4-6 as his/her acceptance range. On the other hand, if the customer strongly disagrees with 
the assigned characteristic level, he/she may provide his/her own estimation of the level of the characteristic in 
the presented program section and give his/her acceptance accordingly. Of course, the major advantage of such a 
"rave review" procedure over ballot completion is that instead of voting on an abstract concept, the customer now 
makes estimations based on concrete examples. 
55 [0053] Since the customer may not be able to remember exactly his/her preferred level of a characteristic that 
he/she indicated in a previous review or in an original ballot or may not have known his or her assigned initial 



50 



value for a characteristic, he/she may intentionally or involuntarily repeat the same level for the same 
character,^ ,n one review tf that characteristic appears in more than one program section, even jflS 
characteristic should not have he same level. Therefore, in a "rave review" a program section with a lame variS 
of characteristics should be selected to avoid repetitions of the same characteristic across several video dbT In 
addition, the customer should be advised that while making level estimations he/she should concentrate on the 
features of the current program sections and forget his/her previous ratings. 

[0054] As mentioned above, changes in a customer profile should be expected over time. In other words the 
values of cv and sv obtained from the rave reviews typically will be different from the values specified in the 
original ballot and during previous rave reviews. In order to avoid dramatic changes in the values the settina of 
the new values should take into consideration both the old and the new data. Thus, if x represents either cv or sv 
x " is the value of x after period n (n "rave review" changes), and y" is the value obtained during time n, then: 

x" = y" whenn=1 Equation (4) 

and 

x" = x"-' + (1/n) (y-» - x"-') when n > 1 . Equation (5) 

Each y is obtained either from the customer's ballot or rave review, but typically y' is from the ballot data and v' 1>1 
is from the rave review data. From the above equations, it follows that: ' 

x n = (Vn) I, y 1 , Equation (6) 
i.e., the new value of x after the n iterations is equal to the average of all the n-1 previous data. The method is 
formally termed MSA, or Method of Successive Average. With this method, data at each iteration is equally accounted 
for m the final value. Any dramatic changes will be damped down, especially at later iterations (when n l large? 
This approach agrees with intuition, since a customer's profile should stabilize over time 9 '' 

[0055] However, customers may have systematic bias in estimating their preferences. For instance a customer 
may constantly underestimate or overestimate his/her preference rating for a characteristic. One way to detect the 
inaccuracy rs ; to check if there are some customers who, when viewing various programs in rave reviews constantly 

Sf if W £ the „ C °?K en Pr ? f " e Va ' U ,t f ° r 3 particU ' ar cnaracte ^o and suggest disproportionate hig^SS 
ratings If frequently the t values m the t significance test turn out to be insignificant despite their agreement 

^k^-ssk to adjust the cus,omers ' ratings of the characteris,ic in ^ in tne = 

[0056] Another way to make adjustments to the customers' combined ratings is through the clustering of 
customers. Customers are asked to give overall ratings for various programs. If a group of customers come up with 
very similar ratings for most of the programs in a category, it is assumed that the actual acceptance Tanges for 
these customers for each characteristic relevant to the category forms a narrow distribution, i.e their values are 
close, to each other. However, if in the distribution of stated ratings, some outside values wh eh are fa away from 
the majority are seen, then the indication is that these outside values need to be adjusted 

[0057] There are many algorithms to find outsiders in a population. For instance, all the values may be sorted 
in the ascending order of their absolute distances from the mean, and gaps searched for at the lower end Those 
values that are located below the largest gap would be outsiders. For statistical validity, the mean and standaS 
2°"°' * -Population less the outsiders may be calculated and a t significance test conducted to deSnftf 
any of the outsiders belong to the population. Only those that do not belong will be subject to adjustment. m ' ne * 

D. Adjusting Content Profiles 

[0058] As discussed above, in a rave review, the customer may state his/her disagreement with the rating of a 
characteristic in a video program and put forward his/her own rating for each characteristic in the program This 
provides a mechanism for adjustment of the content profiles. program, i ms 

S22L J" S T?r ^T nt inVenti ° n may USe th8 ratingS of ex P erts or test qrou P s as the reference base- 
Generally, the calculation of the agreement scalar ac is based on the values of cp and so. Since the vaE 

%«2lTJr,^ Calc ( ula ' ing ac for a " ««t°mer5. any inaccuracy in their values will affect the final 

resuKs for a customers. (By contrast, an inaccuracy in the value of a customer's cv and sv only affects the results 
for that customer.) By definition, customers collectively make ratings relatively closer to reality than any experts 
or test groups. In other words, the customer's rating is reality. For instance, if all the customers on the ^ average ?Sd 
StinnlfoS h 3 P art,cular /f characteri f c (for one or ail programs), then the experts' or test groups' objective 
ratings for that characteristic (for one or all programs) should be raised to agree with the customers' perceptions 
[0060] Again, MSA may be used for the content profile adjustment; letting x be either cp or sp and letting 
y n be the value collectively suggested in period n by the customers for the variable, where y" is defined as: 

y" = (1/1) E, Y", Equation (7) 

where y", is the value suggested by customer i during period n. By substituting y in Equation (5) into Equation (4), thex 
value may be calculated. For customers who do not state disagreement, their y" may be set to x n , ie the 
original content profile. Therefore, y" is the average of the customers' suggested value at period n ' The 



resultant x n+f is the adjusted content profile after period n. 

[0061] This method would be -less useful if only the content profiles of a characteristic for individual 
programs may be adjusted. Often, the relative bias is' systematic, i.e., as seen from the customer's side, the 
characteristic values may underestimate or overestimate the significance of a characteristic for programs of certain 
or all categories. This problem can also be addressed as follows. 

[0062] For clarity in discussion, subscripts again will be used. y Jk is the customers' average suggested rating for 
characteristic k for program j. The distribution of the customers' ratings is assumed to be normal. For simplicity 
time subscripts are dropped. A t value significance test is then conducted as: 

Equation (8) 

(i=1,2,...,/;/=1,2,...,J;/c=1,2 K) 

where: 



S Yjk-cPjk - -\j sv2/ P-V + sp 2 / (M - 1 ) , Equation (9) 

and where: 

t jk is the t value for significance of difference between the customers' suggested rating of characteristic k for 
program j and the corresponding assigned objective rating; 

Syjk-cpjk is the standard deviation between the distribution of y jk and that of cp jk ; 

I is the total number of customers; and 

M is the total number of "experts." 



[0063] If t jk is significant for a pre-defined level (say 0.05) with degree of freedom of I + M - 2, then cp jk is 
determined to be significantly different from y ;7f . In that case, an adjustment in cp jk is necessary, and MSA is calculated 
to obtain the new cp jk from y jk . 

[0064] With the above method, only the assigned objective rating of a characteristic for individual programs is 
adjusted, in order to adjust the assigned objective ratings of a characteristic for all the programs in a category, 
the following is used: 

tj-ik Equation (10) 

where: 

T lk is the average of the t values for characteristic k in all the programs for category I; 

t H k is the t value for significance of the difference between the customers' suggested rating of characteristic 
k for program j, and the corresponding objective rating; and 

J, is the number of programs in category I. 



[0065] If an adjustment of the assigned objective rating of a characteristic over several or all categories is 
desired, the t values of an even wider range are averaged. For instance, if it is necessary to make an adjustment 
for all categories, calculate: 

T k = (1 /L^Z^S,^ t Hk , Equation (11). 

where: 

T k is the average of the t values for characteristic k in all programs. 



[0066] When a content profile value cp is changed to cp\ it is also necessary to change the 

corresponding sp (deviation in cp) to sp\ Because there is a distribution in cp, there must be some expert(s) whose 
rating(s) is below or above the mean (cp). If through the above calculation cp overestimates/underestimates reality, 
it is assumed that only those experts whose ratings are above/below the mean made an overestimation/underestimatioa 
Therefore, after the adjustment, the new deviation sp* should be smaller thansp. 
[0067] One possible calculation of sp* is: 



sp' = sp/( 1 + a*|cp'-cp|/cp) Equation (12) 
Since cp > 0, sp' < sp (i.e., sp always declines after adjustment). If a = 1 , cp = 3, cp' = 4, and sp = 1 , thensp* = 0.75. 
The parameter a thus determines the rate of decreasing in sp with the rate of change in cp. 

[0068] It should be pointed out that before making actual changes to content profiles determined by experts or 
test groups, it is desirable to consult with the experts or test groups for the proposed changes. That will not only 
preclude any unreasonable changes, but also will reduce possible future bias by the experts or test groups. 



E. Customer Moods and Time Windows 



[0069] Few people are purely single minded, especially when enjoying entertainment. Besides a generic 
propensity, it is therefore reasonable to assume that each customer could have one or more viewing moods, and in 
each of the moods he/she would like to watch a particular set of program categories. For normal and not highly 
10 capricious people, the moods should be time-specific, i.e., each mood has a. time window, within which the mood is 
effective. 

[0070] On the other hand, people are not free all the time. The time when they can enjoy entertainment is 
limited. The time window concept can be used to represent this temporal limitation as well. Thus, each time window 
can be expressed as a pair of time variables, 1 and u, where 1 is' the starting point of the window and u is the 
15 ending point of the window. Customer profiles used in accordance with the preferred embodiment of video scheduling 
preferably incorporate this concept of moods and time windows. 

[0071] In the present invention, each customer preferably has a generic mood and may also have some specific 
moods. Both generic moods and specific moods may or may not be time-specific. In fact, a non time-specific mood can 
also have a time window, only with I = u, i.e., its window covers the whole day. Typically, for a particular 
customer, the time window for his/her generic mood will have the greatest width, and the width of the windows for 

20 his/her specific moods will decrease with an increase in specification. In this sense, all the moods of a customer 
form a tree, in which the generic mood is the root, and a specific mood becomes the child of another mood if the 
former's window is contained in the latter's window. For example, if a customer has four moods: generic, peaceful 
violent, speculative, the generic mood may cover all times, the violent mood may cover 6 a.m. to noon, the peaceful 
mood 6 p.m. to midnight, and the speculative mood from 8 p.m. to midnight. Thus, the speculative mood is a child 
(subset) of the peaceful mood. The mood at the lowest (most specific) level of the hierarchy is generally used to 

^ develop the program list for the customer (described below). 

[0072] The definition of moods can be the responsibility of the customer. When ballots are used to create the 
initial customer profiles, each ballot may correspond to a mood. In other words, a mood may be equivalent to a 
customer profile. The generic mood or generic customer profile is required unless there is an automatic system 
default mood or profile. Beyond that, the customer can fill out as many ballots as he/she likes to establish 

20 specific moods. 

[0073] A satisfaction factor, sf, is attached to each mood. For the generic mood, sf = 1, which is the 
base, sf increases as the time window narrows since it is reasonable to believe that people get greater 
satisfaction as their more specific requirements are met. sf is either determined by the customer or takes a default 
value. For instance, the system may set a maximum value on sf for the most specific window, which is two hours wide 
and do a linear interpolation to find the sf values for windows of greater widths. If the customer provides the sf values 
for his various mood windows, the values will be normalized in light of the base value of one and the above- 
mentioned system-set maximum value. 

[0074] With the introduction of time windows, each customer i (or customer-mood i, to be more accurate) will 
take on time window superscripts as i'-* while each program j will become j u ' uj , where I, is the starting time of 
program j and u ; is the ending time of program j. The calculation of the agreement scalar ac then proceeds as will be 
40 described in the next section. However, the calculation of as, the final objective value, becomes: 

as.. = sf. * [wc } * ac.. - wf * f(l. f u jt \ p u.)], Equation (13) 
where f(l„ u j( l jt Uj) gives a punishment value expressing the customer's dissatisfaction due to the mismatch between 
the time window of customer-mood i and the broadcast time of. program j, sf,- is the normalized satisfaction factor of 
45 customer-mood i, wc ; is the weight for the existing agreement scalar, and wf is the weight for f, which needs to be 
determined through practice. 

[0075] The major issue here is the form of the punishment function f. Intuitively, f = 0 when the mood window 
contains the broadcast window, i.e., I, < l ; , and u, > u ; , and f increases as the two windows move away from each other. 
Since u,- - l,> u ; - l y , i.e., the mood window is not narrower than the broadcast window, the time discrepancy d between 
50 the two windows may be expressed as': 

d = max( 0, l t - l jt u j - u. ), Equation (14) 

Sof = f(d). 

[0076] It is reasonable to expect that the customer's dissatisfaction increases rather sharply when the 
mismatch of the time windows first begins, which means he/she will miss some part of the program. But when the time 
55 mismatch increases further, the customer's discontent will level off. For example, the customer will feel quite 
upset if he/she misses the beginning ten minutes of a program which he/she likes. However, if he/she has already 
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where « is the maximum dissatisfaction thaTa cu tomer ou.d haS o misL n 

determines how sharply f(d) increases with d. The g^X^^JTZ J"*^ P " ' param8ter Which 
seen through Equation (15) that f(d) = 0 when d = 0 andfTd - a ! 1 P ' th * stee P er the ™™ would be. It can be 

1 _ ".andf(d)-awhend = ro . Thus, the punishment function becomes 
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III. Calculation of Agreement Matrix 



of the various characteristics used to define the Droarams ttS" i™ £ ? ncorporates the desired amounts 
amounts of these characteristics present in each ^gTa^ de^ffi^ V° ^ CUSt ° mer ' and ,he 

I customers, J programs, K characteristics, and M experts then Tach r»n n tl % ^ gr ° Ups - Assumin 9 there » 
scalar for cv* and cp ;(f ) may be calculated as: P h Ce " " the ln,tlal agreement matrix (agreement 

ac -j = ,/ 0 + (wos k (wv lk w,)y 



where: 



(i= 1, 2, ... I, J = 1, 2, ... J), (17) 



t ljk = = Mcv lk - C p 3k )| / Scvik . c? . k , if CVik >m 0 
n - t */ if cv, v = -i 

(X - 1,2,.., I; J = 1,2, ..,J; k , 1,2*., K) 



and: 



and: 



Equation (18) 

V »fc = ^/^W Equation (1 9) 



r U : r^rr' SC t r " Pr ° fileS ° f CUstomer j and th ^ of program r 

in ° f d ^ enCe bet — the of characUic ft customer i and that 

Scvi k -cp jk : standard deviation between the distribution of cv ik and that of cp ■ 
w ' * 2 k wv ik , i.e., the sum of all weights for customer i; ' 

M: • number of "experts" who rate program j- and 

number of times of consideration before the customer reaches a final decision on the rating for cv . 



[0078] 

from 



J] The magnitude of each of the t values shows the rWiatinn ^ th 

that of the video program gr.en the distributes of the raS Z lo^Z ^ ° f 2 characteri * 

group. If the t value is significant at a predetermined levS of*fn£ customer and the experts or test 

2M - 2, then the two distributions could be regarded as betono n X S *' e " 9 " °!° 5 f ° f degree of freedom of 
values can serve as an indicator of the divergence distance in chJr^T P ° PUlati ° a ThS 3Vera 9 e of a " the ' 
customer and that of the program. Therefore, the variable ac whfc h h^^f 806 betWSen the P rofile ° f the 
t values (reciprocal of the distance), exhibits the ^^^2 h the aVerage of lhe 

reaches its maximum value of 1 (perfect agreement) o^Z^^.?^ 1 ^ ^ % ^ and 

ino-d^b'etwee^^ 

characteristics and the customeMgnoS those char acterist^ ^ : « ^"ces only on certain 

SrT 90 ^^ 9 a " lhS Pr ° 9rams in ' hs 

Kmber of ^?JSSff * £ iSS^TS^^ M "? N ,« «» ^ — While M is 
customer reaches his/her final decision on iTZg o?TZ v^ma'v beT? ° f co w nsiderafon before the 
experiments. Generally speaking, the higher N's value thJ \nZr I t ^, 9 delermined empirically through 
various characteristics on average. Preferably, N - M 1 S o7h a SnEJ S*- CUSt ° mer ' S aCCS P tance ' or 

y 1 so mat d 'spersions in cv, which is interpreted as the 



customers acceptance range and in cp, which represents the difference in the experts 1 voting are equallv 
counted Th.s calculation ,s underpinned by the assumption that both the distributions of the experts' Vote and the 
£2dfa? "° GaUSSian) ' Alth0LJ 9 h the assu mpfor. is not guaranteed, that is the best that can be 

L 008 ") ,« ° nce ' ne ini 'l al cu stomer profiles and initial content profiles have been established, a simpler form of 
Equation (1 7) may be used by combining wv and s into a single measure of importance: 



where: 



w ik = wv ik /(l/J)£. s .. k , 



V J sv*JM+ spy/W , Equation (20) 

where sv* is the spread in customer fe rating for characteristic k (inversely correlated with the importance of 
k to i), and sp ;> is the spread in the experts' ratings for characteristic k. Thus: 

ac„ " 11 V + Z„ \cv ik - cp Jk \), Equation (22) 
This simpler notation is preferred and will be used throughout the discussion below. However the alqorithms 
described below are a easily extended to the more complex model of Equation (17). Generally, the more comSom 
of the agreement mat™ (Equation (17)) is only preferred when the customers are asked questions to bufld hSr ' 
customer profiles. The simpler form of the agreement matrix (Equation (21)) is preferred when the prof es are 

SfpresenTinlnfoT 9 ^ ^ ■ U * B ^ 35 h the preSSntly preferred ^plenfenSn of 

[0081] For purposes of illustration, a calculation of a simple agreement matrix will be described here 

Lreasfollow" ' S aSSUmSd that there are ° nly customers: (1) John and (2) Mary. Their sample customer profiles 



characteristic (cv): 


romance 


high-tech 


violence 


1 John 


3.0 


9.0 


7.0 


2 Mary 


9.0 


3.0 


0.0 


standard deviation (sv): 








1 John 


1.0 


2.0 


1.0 


2 Mary 


1.0 


0.5 


0.0 


weight (wv): 








1 John 


2.0 


9.0 


5.0 


2 Mary 


8.0 


3.0 


7.0 



[0083] The available programs are as follows: 





Program Titles 


1 


Star Trek 


2 


Damnation Alley 


3 


Forever Young 


4 


Terminator II 


5 


Aliens 


6 


Fatal Attraction 



The sample content profiles are as follows: 



10 



15 



\ char 
program\ 

1 Star Trek 

Damnation Alley 
Forever Young 
Terminator II 
Aliens 



(cp) ; romance tiigh-tech violence 



2 
3 
4 
5 
6 



2 
5 

8 . 
0, 

o. 



0 
0 
0 
0 
0 



Fatal Attraction 7.0 



\ standard deviation (sp) 
prograrn\ 

1 Star Trek 0.5 

2 Damnation Alley 1.0 

3 Forever Young " 1 . o 

4 Terminator II o.O 

5 Aliens 0.0 

6 Fatal Attraction 2.0 



9.0 
0.0 
3.0 
10.0 
8.0 
0.0 



1.0 
0.0 
0.5 
1.0 
1.0 
0.0 



4.0 
1.0 
0.0 
8 . 0 
9.0 
8.0 



1.0 
1.0 
0.0 
1.5 
1.0 
1.0 



[0084] After normalizing w using: 



20 



25 



w i k = (A Scv ik - cp jf /J) Equation (22) 
where s CVJk . CPJk is defined in Equation (19), the above input data produces the following weight matrix (w): 



\ characteristic: 
customer \ 

1 John 

2 Mary 



romance high-tech violence 



.1S6 
.292 



.425 
. 192 



.409 
.516 



C 



30 



35 



[0085] Given the weight matrix and the characteristic profiles of the customers and the programs, the agreement 
matrix may be calculated. For example, the agreement scalar between customer 1 and program 2 is: 

ac 12 = 1/(1 + w n |cv ir cp 21 |+ w 12 |cv 12 -cp 22 | + wjcv^-cp^) 

= 1/(1 + .166*|3-5| + .425*|9-0| + .409*|7-1|) 

= .131 



40 



45 



[0086] The final agreement matrix (AC) is thus: 



\ Program 
Customer \ 

1 John 

2 Mary 



.418 
160 



307 



. 138 
774 



429 
110 



.365 
. 108 



.170 
.159 



50 



55 



[0087] From the agreement matrix, it is evident that John prefers "Star Trek", Terminator II" and "Aliens- 
while Mary prefers "Forever Young" and "Damnation Alley". This is the results that would have been expected from 
the profiles, only here the preferences have been quantified. 

[0088] Of course, in the simple case where merely the presence or absence of particular characteristics are 
measured, the agreement matrix would look for identity in the most categories rather than the distance between the 
customer profile vector and the content profile vector using the techniques described above. 

IV. Scheduling Video Delivery in Accordance with Customer and Content Profiles 

[0089] The introduction of time dimensions makes possible the scheduling of video programs, i.e., the 



assignment of programs to days and to time slots in accordance with each customer profile. 



A. Scheduling Constraints 



[0090] Solving the problem of assigning days and time slots simultaneously is often impractical because of the 
exponential increasing order in the number of possible combinations. Therefore, the two' tasks are preferably 
performed separately through heuristic methods. 

[0091] The first task, assigning programs to days, is simplified significantly by the fact that in the present 
system the customers' preferences are differentiated mostly by time slots (hours), rather than by days. When the 
customer defines weekday mood time windows, it is assumed that the window will apply to any weekday. Of course, the 
customer may define some weekend time windows, which apply to either Saturday or Sunday. Therefore, for a given set 
of programs available for a week, the major question is not on which day to broadcast them, but during which hours 
to broadcast them. 

[0092] A possible approach to scheduling is that for each program its top n most- preferred broadcast windows 
are determined from the average of the objective values as calculated using Equation (13). The scheduler then uses 
some methods to find a solution in which the average objective value reaches a reasonably high value, and in which 
the time slots are covered. There are many such methods available, such as integer linear programming. 
[0093] . An extra complexity is the possible necessity of repeating some programs during a day or a week, because 
of their high popularity. A simple approach is to let the scheduler first determine the number of necessary 
repetitions, and then add the number in as constraints in the programming. However, it should be noted that the 
above approach is for just one channel. If there are multiple channels, Then it is usually necessary to first 
categorize the channels, find their respective "target audience", and then run the scheduling procedure on the 
target audience of each channel. 

[0094] Attention should also be paid to the mutual exclusion among the overlapping time windows of a customer. 
Although the customer may define time windows which conflict with each other, in terms of overlapping and 
containment, only one of the windows in the. conflicting set can be used in the final assignment. This condition 
should be added to the constraints. 



B. Scheduling Algorithm 

[0095] With the above scheduling constraints in mind, the present inventors have developed an algorithm which 
uses customer profiles and content profiles for scheduling the broadcast of movies and other shows over a video 
distribution network which allows the simultaneous distribution of many channels from a head end to the set top 
multimedia terminals associated with many customers' television sets. The same approach is then used to develop 
"virtual channels" at the set top multimedia terminals based on domain or genre or tastes of individuals so that the 
customer can view the video programming predicted to be most desirable to that customer. The "virtual channels" may 
be displayed on dedicated channels, or the recommended programming may be highlighted directly on the electronic 
program guide or displayed on the customer's screen as recommended programming selections. Also, the channels 
may be reprioritized for presentation on the electronic program guide on the basis of the calculated Virtual 
channels." Similarly, video programming of a particular type, even if not part of the "virtual channels" may be 
highlighted on the electronic program guide as desired. The algorithm for determining the recommended programming 
is based on the above-mentioned "agreement matrix" which characterizes the attractiveness of each movie and video 
program to each prospective customer. In short, a broadcast schedule and/or virtual channel is generated which is 
designed to produce the greatest total customer satisfaction. The generation of the agreement matrix and the 
scheduling of programs in accordance with the generated agreement matrix wili now be described in more detail. 
[0096] As described above, the agreement matrix may be produced by comparing the customer profiles and the 
content profiles. In the following description, it is assumed that the agreement matrix is normalized so that all 
agreements between customers and movies lie between zero and one. 

[0097] The basic problem of scheduling a cable television broadcast can be formulated as follows. 

[009S] Given an agreement matrix A where ac» is the agreement scalar between customer i and program j, find: 

max 2^- Ej ac ij 
J £ N i 



Equation (23) 

where J is a set of programs to be broadcast drawn from a set Q of candidate programs available for broadcast the 
first summation is over all customers i, and the second summation (of j) is over the n, programs in the set of 
programs K, that customer i would most desire to watch. In other words, given the agreement matrix between 
customers and programs, it is desired to pick the set of programs which maximizes the agreement between customers 
and those programs which the customers might watch. For example, if a hundred programs are being broadcast and a 
given customer would not consider more than five of them, it does not matter how much the customer likes the other 



ninety-five programs. However, as noted above, the actual problem can be much more complex since different 

Sir ToT d8Pend ° n 'r ° f day and SinCS mul,i P' e time slots cannot brsS.SIJdeAnden^ 
nrnhlim i« triSil Y Z™ 9 ™™ ^ t0 , be br ° adCaS ' USing the method of the invention - »e above optimization 
problem .s tnvally solved by summing each column of A (calculating Z , ac,) and picking the program j which gives the 

SorT^st^sT^uZ^ Se,ededl h ° WeVer ' ' iS TOt P ° SSlb,e t0 ^ a " ^ — ^ 
[0100] The following algorithm is an example of a greedy algorithm which provides an efficient algorithm for 
approximately solving the above scheduling problem including the fact that it is desired to select a digrams for each 
customer . (the Viewing appetite"). In other words, n, represents the number of programs scheduled for broadcast to a 
particular customer at any time. In a preferred embodiment, n, corresponds to the number of "virtual" channels 
available to each customer. 

P 1 ? 1 ! a- J" at * ordanc *; with ,he Mention, the "greedy" scheduling algorithm will function as follows As 
illustrated in Figure 2, at each step of the algorithm: "onows. as 

il^JSlT*^ 1 WhiCh ^l dS the 9reateSt satisfaction ^er the current customer population (ie 

those eligible to receive program j). y p 

2) Decrement the viewing appetite n, of those customers who have the greatest agreement scalars with the 
currently selected program. 

3) Remove from the customer population any customers whose viewing appetite has dropped to zero (n = 0) 
since they have all the shows they need and hence are not a factor in selecting further shows. ' 

Sls avItebf^X ^ nUmb6r ° f Pr ° 9ramS Se,eCted ' m ' ^ the ° f ^oadca, 

[01 03] A more precise description of the process may be described in pseudocode as: 

0) Initialize: 

Let apj = n, for all i set all customers' appetites to n, 

Let V = {I} initialize the customer population to include all customers 

Let m - 0. start with no programs selected 

As described below different initializations are possible to account for programs which will always b« 
broadcast. Also, if ind.v.dual appetites are not available, all can be set to a single value n. Y 

1 ) Select the currently most popular program: 

Select program j which gives max £. |nv ac^ 

Let m = m + 1 increment number of programs selected 

If m = M, stop, stop if done 

otherwise proceed to (2). 

2) Decrement the appetite of those customers who like the currently selected program: 

Select the customers i in V for which a C|j is above a threshold value a Then decrement the appetites for 
selected customers by letting ap, = ap, - 1 . 

3) Remove customers from the current customer population V who have no appetite left: 

Remove from V customers j for whom ap {j = 0. Go to (1 ). 

Fh r0 t dl T S 3 in WhiGh 3 Vari9ty 0f P r °9 fams are se,ected according to the spread of 

customer interests. Note that the s.mpler algorithm of selecting the most popular programs (those with high4 
agreement matrices) will not produce acceptable results, for if a majority of customers orefe action film* thfn 
only action fOms would be selected, leaving the minority of customers with no films * 
[01 04] Figures 1-3 summarize the above-menlioned procedures for establishing customized channels of preferred 
programs in accordance with the invention. preierreo 

[0105] As illustrated in Figure 1, a schedule of available shows and their characteristics (content profits) 
,s created and stored in a database at step 102. As noted above, the characteristics of the shows may be detJS 



by experts" or test groups by completing questionnaires and the like, or the content profiles may be generated from 
the frequency of usage of certain words in the text of the video programs (the on-line descriptions or the script) 
Alternatively, the content profiles may be determined by combining the customer profiles of those who "liked" the 
video program during a "rave review. " Preferably, the content profiles are downloaded all at once for a given time 
period along with the corresponding scheduling data as part of the electronic program guide data and sent via a 
separate data channel. On the other hand, the content profiles may be transmitted as part of the bit stream of the 
video program (for digital transmission), in the vertical blanking intervals of the video program (for analoq 
transmission), or by other appropriate means. ' 
[0106] At step 104, the customers' preferred characteristics (customer profiles) are created and stored in a 
database. As noted above, the customer profiles represent the customers' preferences for the program characteristics 
and preferably differ in accordance with the time of day to account for different moods of the customer and 
different customers within each household. In a preferred embodiment, the -customer profiles for each household are 
stored in the set top multimedia terminal for that customer's household. 

[0107] The content profiles received with the electronic program guide data are preferably stored at the set 
top multimedia terminal and compared by the set top multimedia terminal to the customer profiles for each customer 
An agreement matrix is then created at step 106 using the techniques described above. Once the agreement matrix 
has been generated, those programs with the highest values for ac, i.e., the closest distance (1/ac) and hence 
closest match to the customer's profile or profiles, are prioritized and selected for presentation as "virtual 
channels" (in the case of creating Virtual channels" at a set top multimedia terminal) or as the programming 
channels (in the case of scheduling video programming at the CATV head end) at step 108. This process is described 

in more detail herein with respect to Figure 2. . ~ ~ 

[0108] In a simple embodiment of the invention in which no feedback is used to update the customer profiles no 
further activity is necessary. However, it is preferred that the customer and/or content profiles be updated to 
allow for changes in the customers' preferences as well as to correct errors in the original determinations of the 
profiles. Accordingly, at step 110, the customers' set top multimedia terminals maintain a record of the video 
programs that are actually watched by the customer for a period of time (say, 10 minutes) sufficient to establish 
that the customer "liked" that program. Of course, the monitoring function may be selectively activated so that the 
profiles are not always updated, as when a guest or child takes control of the television at an unexpected time. 
[0109] Finally, at step 112, the customers* profiles are updated to reflect the programs actually watched by 
the customers. Such'updating techniques are described above and further below with respect to Figure 3. 
[0110] Figure 2 illustrates a technique for selecting video programs for "virtual channels" at the customers' 
set top multimedia terminals or, alternatively, for scheduling video programming at the head end from the available 
video programming sources. As illustrated, the method is initialized at step 202 by determining which customer 
profile or profiles are actrve for the time period to be scheduled, by determining the customers' appetites (number of 
channels available for transmission), and by determining the database of video programming from which the schedule 
may be created. For example, at the head end, the video programming database may be any video programminq 
available for transmission during the designated time frame, while at the set top multimedia terminal the video 
programming database comprises only the video programming on those channels which the customer is authorized to 
receive 



[0111] Once the agreement matrix for the available video programs has been determined, at step 204 the most 
popular programs for a single customer (at the set top multimedia terminal) or a cluster of customers (at the head 
end) are selected and removed from the list of available programs during the relevant time interval Of course in 
the case of scheduling at the set top multimedia terminal, the video programs scheduled onto "virtual channels" are 
still received on their regular channels and the "virtual channels" are assigned to unused channels of the set top 
multimedia terminal. At step 206, it is then determined whether the customer's appetite is satisfied (at the set top 
multimedia terminal) and whether all the customers' appetites are satisfied (at the head end) If all relevant 
appetites are satisfied, the scheduling algorithm is exited at step 208: On the other hand if all customer 
appetites are not satisfied, the appetites of the customers likely to watch the selected program are decremented at 
step 210. Hence, only those customers with preferences which relatively "match" the characteristics of a particular 
video program have their appetites decremented. At step 212, those with no appetites (channels for 

qqKoHi i linn \ loft 

are removed from the scheduling list. The process is then repeated starting at 'step 204 for those with channels left 
to schedule. 

[0112] When establishing "virtual channels" in accordance with the invention, it is important to know which 
customer profile or profiles to use in creating the agreement matrix. In a preferred embodiment, this is 
accomplished by using the customer profile or combination of customer profiles which are given priority during a 
particular time interval for a predicted customer mood. This determination is made independent of the person 
actually viewing the television. However, the system of the invention may be easily modified to permit the customer 
to identify himself or herself by providing a user ID to the set top multimedia terminal so that a particular 
profile of that customer may be selected in the determination of the agreement matrix. In other words custom-r 
names may be matched to particular profiles based on selections made when that customer's user ID has control of the 
television. In addition, combined profiles may be created which best reflect the combined viewing tastes of several 
persons in the same household. On the other hand, the system may come with preselected profiles which the customer 
may select to use as his or her initial profile. After a certain amount of time, the system would recognize a 
particular profile as belonging to a particular viewer or combination of viewers so that it would eventually be 



unnecessary for the customers to input their user IDs. In other worts, the system would "guess" which customers are 
viewing by noting which customer profile is closest to the shows being selected. Of course, this latter approach 
requires the customer profiles to be matched to individuals rather than just time slots as in the preferred embodiment 

E123L a T? ? J " U f! ra,eS , a , pref ^ ed technia - ue fo r ^dating customer profiles in accordance with the 
invention^ As illustrated, the initial customer profiles are selected at step 302 using any of the techniaues 
described above. At step 304 the agreement matrix is calculated to determine which vide 9 o ?gl"K 
might desire to view in the selected time period. Then, at step 306, the passive monitoring feature of the invention 
is invoked to determine t the customer actually watched the video program selected from by the agreement mS If 
the customer watched the predicted program, then the customer profile is presumed accurate at step 308 and no 
adjustment is made. 0 course, the customer profile may be positively reinforced by varying the adjustment 
mcrement However, if the customer did not watch the predicted video program, the customer proffe forThe 
appropriate time interval ,s selected at step 310 which has characteristics closest to those of the video program 
actually watched. That customer profile is then adjusted using the techniques described above. The adjusted S 
is then considered valid until the next time slot is encountered at step 312 The agreement matrix is hen 
recalculated at step 304 for the new customer profiles and video programs offered in the 'next tSot 

L 01 ! 41 a •„ P ^ CUlar u ha J rdW f e im P lementations of the invention in a set top multimedia terminal and/or a video 
head end will be described in Section VI below. 

C. Scheduling Variations 

[0115] ^Many variations to the above generalized scheduling scheme are possible within the scope of the 
invention. The following vanations may be used by those skilled in the art, but, of course, this list is not comprehensive. 

A^ Special programs such as standard network broadcasts may be included in the scheduling. In this embodiment 
when certain programs have already been scheduled for broadcast, such as standard network programs or specially 
selected popular mov.es, the above algorithm is modfied to account for the effect of these programs on customer 

^r^'moHifTf'' 11 ' 19 V ! de ° P r °9T ming - This is easily done ^ initiall V ™ nin 9 the above algorithm with 
f! P ™ dlfied t0 Slmp| y ' nclude the Prescheduled programs rather than selecting new ones When all 
prescheduled programs have been "scheduled" (i.e., customers likely to watch the prescheduled programs have 
been removed from the customer pool and the broadcast slots have been filled), then the scheduling algorithm 
proceeds as usual. As desired, this will lead to additional movies being selected which will appeal mos to 
customers who will probably not be watching the standard network broadcasts. 

n^ Th f^ eCt 0f . recent broad ( ca sts may be included in the scheduling. The above scheduling algorithm is 
presented for a angle time slot. In actuality, the video programs selected must depend on which other video 
programs have been shown recently. This can be done in several ways. For example, one can remove rec-nUy 

fmTtho ?^ e of " St f^Tt aVa !' able t0 broadcast - Alternatively, one can remove recently shown movies 

from the list of movies available to broadcast when their popularity (number of customers per broadcast) drops 
below a threshold. This approach is better in that it allows new hits to be broadcast multiple limes More 
complex models may explicitly include a saturation effect by shifting the agreement matrix based on the number 
of similar video programs recently viewed. ^««un renuraer 

a The effect of overlapping time slots, such as so-called "near video on demand" may be included in the 
scheduling. For this purpose, the above algorithm can be modified to account for the fact that popular video 
. ma V 0CCU Py .™ re 'nan one time slot and that time slots may overlap. As an extreme example it may be 
desirable to offer multiple overlapping broadcasts of a popular movie on vacant channels, e.g., at 15 minute 
intervals, n this example he customer appetite concept set forth above would be augmented by a more 
sophisticated mode which includes the fact that customers turn on the television at different, possibly random 
times, and only want to watch shows which are starting at times close to the time they turn on the television 

S;l h Z 6ffeCt °J T 0d , S ^ be , included in the scheduling. If one has different agreement matrices for different 
J°f r n moods (erther reflecting multiple customers with different tastes using the same television or 
2£5S tk ^ T , n9 ffere ? ( VISWing P references due t0 ™od), the above algorithm can be trivially 
t h2 n * ,T n u m °°« S a .u ! USt trSa,ed 35 different homers, with the appetite for each mood selected 
£inn EKi h? K W f ° ( K Sn fl ] at ( m , 00d ° CCUrS in the time Sl0t bein 9 scheduled. This will result in programs 
being scheduled ror each of the potential moods; the customer can then pick his or her preferred show. 

E. Programs may be matched to channels for scheduling content based channels. Customers may prefer to have 
SoJTrnS h J oonsistent content or style (e.g., sports or "happy" shows). The basic algorithm presented 
above could be modified so that some slots are reseived so that shows of a given type (e.g. close to a given 

scheduSn! TSo^hm * ^ " be Seleded 1 ' hey haV9 not alread y been chosen in the ™" 

F. How the customer appetites are decremented may be selected for scheduling purposes. In the above scheduling 



agorrthm, whenever a program is chosen, the viewing appetite of all its "audience" is decremented, and those 
who have used up their appetites are removed from the viewing population. Since the checking of viewing appeSe 
h T' y °i t f he , audience °J. the currently chosen program, the outcome of the scheduling process depends n 
part on the definition of audience - who will potentially watch the program. Above it was assumed for 
simplicity that customers should be included in the potential audience if their agreement with the program 'was 
above a threshold. However, many variations on this are possible. For example, one could pick a fixed audience 
size for each show. For example: auuwnce 

Select the customers i in V for which ac,, is maximum. n w can be calculated, for example, as n*l/M as the 
average customer appetite, n, times the number of customers, I. divided by the number of programs M to be 
broadcast. One could also make the threshold a variable, either by decreasing the threshold or the number 
of customers n w over the course of developing a schedule so that the first programs scheduled would have 
large audiences, while the last programs scheduled (those done after most customers needs are satisfied! 
distributbns audiences - ldeall V. "^s would be done in accordance with observed audience size 



JLE Ti i t may be included in the scheduling. The algorithm could be modified to account for higher 
viewing levels in shows which follow .mmediately after popular shows (the "lead-in" effect). As described above 
this can be accounted for by treating errors in predicting customer behavior differently if they result from the 
customer remaining tuned to a channel. . y me 

H. The effect of repetitive showings may be included in the scheduling. Since viewership is a strong function of 
the time of day and of the day of the week, one cannot assess the popularity of a show based on the number o 
peope watching it without controlling for the time slot. Similarly, movies shown at the same time a? v4v 
popular programs or as very similar programs tend to have fewer viewers - the audience will be divided The 
algorithm given above does not rely on absolute viewership numbers and so does not have these problems 
Similarly when many similar programs are shown over a short span of time, there is viewer "burnout"' In other 

words, if the same move is shown repeatedly over- the course of a month, it will get fewer viewers on late 
showings. As another example rf many golf programs are broadcast during a week, each customer's desire to watch 
golf will saturate, and viewership will decay. Predictions of what a customer will want to watch only makes sense ff 
they have not watched the same (or almost identical) show recently. Thus, changes to the customer profiles and 
content profiles should not be made if, for example, a customer does not select a movie to watch which they recentlv 
watched. However depending on the indexing scheme used to store viewing habits, checking to see if a similar 
program was recently watched, while straight forward, may require a significant amount of database search. 

I. Customer profiles can be modified on an individual basis. Since different people often watch the same 
television, and most feedback devices in popular use do not recognize which customers are present customer 
Preferences cannor oe characterized by a single agreement matrix. Also, customers may have different agreement 

El Pe TT \ m00d ; lf u m ° re than ° ne dement matrix per television is initially estimated (e g 

by interviewing multiple customers), then the above algorithm can be modified to only count a prediction as 
wrong rf none of the agreement matrices for a given television yields predictions that agree with what was 
actualy watched. One or more of the agreement matrices for the television could then bl updated using The 
algorithm This is not ideal, in that one does not know which mood (or customer) was present but the best that 
one can do ,s assume that it was the mood (customer) whose agreement matrix- came the closest to grving the 
correct prediction. On the other hand, the customer may simply identify himself or herself when the televison 
rs urnec Ion, and preferably may specify which profile to use based on who is present and/or the customer's mood 
All of these effects can be taken into account in developing customer profiles and content profiles and in 
scheduling video programs in accordance with the invention. H ™ In 



V. Passive Characterization of Customers Using Feedback 



[0116] Thus far, the invention has been described in the context of a "filtering" system in which all of the 
video programming available at the head end is scheduled on "customized" channels in accordance with the customer 
profiles of customers and in which a subset of the programming on the "customized" channels available to each 
customer is selected using an agreement matrix for presentation to the customer as "virtual channels" tailored to 
charactenstic profiles. However, one of the more interesting applications of the above-mentioned 
customer profile system is that the same customer profiling system may be used to provide feedback from individual 
customers regarding what characteristics they find most desirable in the broadcast shows. By obtaining S 
information, the customer profiles may be appropriately updated as described above. As will now be described the 
S 1° SKTT ed ( U ' eS 3 !f° may b ! Updat6d t0 reflect the comers' actual preferences, and information may 

5£E£XSJ£ ZT^S raph>cs and customer profi,es t0 provide ,ar9e,ed advertisina and ^ sh °p 



f J? l. f »u Y tUrS ° f many video/cab| e television installations is that it is possible to obtain active 
feedback from the customer: either simply what was watched at each time or rrire cStelv how mucnlh* 
customers in their estimation) liked what they saw. Monitoring viewing patterns to hereJ ^wS 

invention does not rejure an^customer actions. A^wil ^1^X2^^ 
characterizations of customers' preferences for proarams which in ti.m 1 « ? * _T im P rove 

ofP-gram^^ and scheduling 

SnSf n nfl f 3 -? CUS,0mers ind j catin 9 wnich video P r °gram characteristics they prefer can be combined with 
f„ fT*"* J 86 * 4 ** can be used both to im P™e individual customer profiles and to improve customer 

[0120] As with methods for updating individual customer profiles one can characterize clustprinn rwh^ f 

those characteristics, uses no measurement of what is watched or other feedback HnweCJ fc . i ? 1 * 
method can be supplemented with customer ratings of mov^^ 

[0121] The technique starts, optionally, with a profile of the customer in terms of what movie characteristic 
thl 5.SLEJ important. It then refines the importance grven co different ch«cto^^i3hSrS35 
the characteristics predict what movies the customer actually watched. accurareiy 

A. Algorithm for Passive Updating of Customer Profiles 

[01 22] The same notation as used above will be used here to describe the methods for using passive feedback to 
improve profiles of customers, movies, and customer clusters. Namely cv. is the amount ^dSSSS f?h2 
customer , desires, wv. is the importance of characteristic k to customer i, and cp y , is the degree to which movie J ha 
characteristic k. For notational simplicity, it is assumed below that the weightings are normaLd ( Z| wv* = Z> £ 
the customer weightings add to one. There are: J movies, K characteristics, I customers M "expert? 1 and P movie, to 

?™7jj:* c ^r and ,ts importance ,o the customer using E ^ a,i ° n <*> - forth ess 

[0123] Given a set of J movies available with characteristics cp ik , and a set of customer preferences cv 
customer i would be predicted to pick a set of P movies to maximize: "trices, cv ik . 



3 xn the 
best P of J 



iccuJLv n mk ' k ! rSnt SS ' ° f P m0V ' eS than WaS Predided ' CV and w * should be *i^ed to more 
accurately predict the movies he or she watched. In particular, cv and w* should be shifted to reduce the match on 
movies that were predicted to be watched but were not watched, and to increase the match on movies Z were 

ThSre ~ — ^ - * this. One is to sJ^cvT eaStmng 

cv * = cv * - A(cv jk - cp Jk ). Equation (24) 

2I2L JJl^fT!!" ^ ^ by maKing CV C ' 0Ser '° cp * A is P° sitK/e • and is representative of the case 
where the algorithm failed to predict a movie that the customer watched. The size of A determines how many exampJ 
movies one must see to replace what was originally believed. If A is too large, the algorithm will be uniaSe but 
for sufficiently small A , cv wil, be driven to its correct value. One could in theory aiso make use o the S ma't 52 



above algorithm will decrease the match if a is negative, as for the case where the algorithm predicted a movie that 
the customer did not watch. However, there is no guarantee that cv will be moved in the correct direction in that case. 
[01 26] One can also shift w /Vf using a similar algorithm: 



Equation (25) 



2;k-a|civc P .j) 

5 As before, this will increase the match if a is positive, as for the case where the algorithm failed to predict a movie 
that the customer watched, this time by decreasing the weights on those characteristics for which the customer 
profile differs from that of the movie. Again, the size of a determines how many example movies one must see to replace 
what was originally believed. Unlike the case for cv, one also makes use of the fact that the above algorithm will 
decrease the match if a is negative, as for the case where the algorithm predicted a movie that the customer did not 

70 watch. The denominator of Equation (25) assures that the modified weights w^ still sum to one. 

[0127] Both cv and w,* can be adjusted for each movie watched. When a is small, as it should be, there is no 
conflict between the two parts of the algorithm. 

[0128] There are several ways to initialize the algorithm, depending on what information is available, including: 
15 (a) questioning the customer as to what characteristics they find important in movies or other programming; 

(b) using a customer profile typical of the other customers with the same demographic profile as the customer; 

(c) using a typical customer profile (assuming no demographics are available); and 

20 (d) random selection, which is not desirable unless a solid history of movies is available. 
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[0129] The example set forth in Section HI above will now be extended to illustrate-the above-described 
feedback process for improving the characterizations of the customer preferences for the programs. 
[0130] As noted in the example in Section III above, the customers' profiles (cv', sv\ wv f ) are initially 
estimated. The estimated initial normalized weight (w 1 ) and the agreement scalars with all the programs are then 
calculated accordingly. During the customer feedback process, each time a customer picks a program which differs 
from the program that is predicted based on the current estimated agreement scalars, corrections are made to the 
estimated characteristic and weight profiles, which assumably will move the estimated profiles closer to the true profiles. 
[0131] . if it is assumed that at time period n customer i watches a movie j but the algorithm predicts program h, 
corrections will be made to customer i's characteristic profile as follows: 



and to the weiaht Drofile as follows: 



cv, fk n+1= cv , ik n.A(cv , lk ".cp' jk ),forall k, 



w' 



~, for all k, 



» 1a w V- A ( cv V-cp' jk + cp' hk 

I.W'^-AKn.cp'. + Cp;,) 

where the positive parameter a determines the size of the correction step. 

[0132] For instance, an initial estimated customer profile (CV°) could be found using a random function to be: 



customer 


romance 


high-tech 


violence 


1 John 

2 Mary 


1.595 . 
6.735 


9.894 
3.897 


9.174 
0.000 



If it is then supposed that there are three television channels and programs 1, 2, and 6 available for broadcast 
John (customer 1) chooses program 1 since it has the highest agreement scalar with him, based on his true customer 
profiles, among the three available programs. However, according to his current estimated customer profile, program 
6 was predicted. Hence, a correction is necessary. For simplicity, changes are made only in his current estimated 
characteristic customer profile cv', a to create his new estimated profile cv'/. In other words, his weight profile will not 
be adjusted. Thus, if a = 0.1 , the customer profiles are adjusted to: 

cv' n 1 = 1.595-0.1 * (1.595-2)= 1.635 

cv' 12 1 = 9.894 - 0.1 * (9.894 - 9) = 9.805 
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cv' n 1 =9.174-0.1 * (9.174 -4) = 8.855 



[0133] The following represents a tvoical feedback 





Initial Estimated Profiles 


5 


1 John 


3.399 


10.000 


5.448 




2 Mary 


10.000 


2.190 


1.216 



Run 1 (92% correct predictions): 
Estimated Profiles -— — 



1 John 


1.943 


9.760 


5.941 


2 Mary 


10.000 


2.190 


1.216 
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Run 2 (97% correct predictions): 
Estimated Profiles " ~ 



1 John 


1.597 


9.735 


6.139 


2 Mary 


10.000 


2.190 


1.216 
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Run 3 (100% correct predictions)- 


Estimated Profiles 


1 John 


1.597 


9.735 


6.139 


2 Mary 


10.000 


2.190 


1.216 



3 5 ^= 100 -oops (time periods) . 

[0134] By way of summary, the foilowing steps shou.d be provided for pasL feedback of customer preferences- 

oSc^ 

o Pjk and w! SncT 3 CUSt ° mer ' * > *" "* M not ^ ^ watch, update cv„. 

cv ik = cv ik -A(cv ik -cp. k ), 
and/or % = ' V A(cv ik - c Pf ). 

w ik = (w ik - A |cv ik - cp jk |) / S k (w ik - A |cv jk - cp jk |). 

- ^^5^52^ z:st us"g ovie j that the a,90rithm predicts ,hey wouid watch <*« °* * «- 

, . W * = K + ^K-cp j k l))/I k (w ik -A|cv |k |-cpJ). 

B. Variations to Passive Characterization Algorithm 
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[0135] Many variations to the passive characterization algorithm are possible. For example, the 

characterization of the movies was assumed to be correct in the above feedback system. As a result, the customer 
profiles were adjusted. However, if one believed the customer profiles and not the movie characterization, one could 
use a similar algorithm to adjust the movie characterizations. In practice, one would use both methods 
simultaneously: if predictions for one person are less accurate than average, that person's profile should be 
adjusted, but if predictions for one movie are less accurate than average for movies, that movie's characterization 
should be adjusted. 

[0136] The simplest algorithm is to shift cp for each wrong prediction for customer i and movie j using: 

°Pjk = °P } k ' ' c P Jk ) Equation (26) 
This will increase the match by making cp closer to cv if a is positive. As before, this should only be done for the 
case where the algorithm failed to predict a movie that the customer watched. 

[0137] Another possibility would be to not use any characterization of the movies or customers, but simply to 
group the customers together based on the number of movies that they viewed in common. This requires overcoming 
some minor technical difficulties in controlling for different numbers of movies being watched by customers On the 
other hand, if active feedback on how well the customers liked the movies is available, one could also use this 
information. Rather than making changes simply based on "watch/did not watch", one can weigh the changes (ie 
alter the size of the a's) based on the degree of customer like/dislike of the program. In addition, one can also 
modify the algorithm to take into account other known determinants of customer behavior. For example, customers tend 
to continue watching programs which are shown onthe channel that they are currently watching. This means that even 
if the agreement matrix correctly predicts that a customer would prefer a program on a different channel the 
customer may not discover the other program (or bother to change to it). In this case, the agreement matrix was not 
incorrect and so the customer and content profiles should not be altered. This can easily be incorporated into the 
passive feedback algorithm by using smaller changes (smaller a's) when a customer remains tuned to the same channel 
(possibly virtual channel) than when they simply switch channels. 

C. Customer Clustering 

[0138] As noted above, customer profiles can be kept for groups of customers as well as for individual 
customers. Grouping customers together into customer clusters offers several advantages. Most importantly, if the 
clusters are accurate, improvement of customer profiles will be much faster, since far more movies are viewed per 
week by a cluster than by any individual in the cluster. Clustering also provides a means of setting up an initial 
profile for new individuals joining a video service in accordance with the invention, as they can, as a starting 
point, be given a profile based on demographic data or on surveys they fill out. 

[0139] There is a long tradition of clustering people based on demographic or other data, and many clustering 
algorithms exist ranging from traditional methods such as factor analysis or the k-means clustering algorithm to 
more esoteric neural network-based methods such as Kohenen networks. Any of these can be used for the task 
described here, but the present inventors prefer the k-median clustering algorithm. Clusters can be formed based on 
(1) what programs people watch, (2) what features of programs customers rate as important (e.g., how similar their 
agreement matrices are), or (3) a combination of programs and features. One can also include demographic or 
psychographic customer profiles or other information. 

[0140] The clustering mechanism selected must address several technical issues. Most importantly, the 
clustering algorithm must take into account the fact that different attributes used for clustering may have 
different degrees of importance, and may be correlated. If one uses as a clustering criterion a pure measure, such 
as maximizing the number of programs watched in common, or maximizing the degree of similarity of the customers' 
agreement matrices, this is not a problem, but if these attributes are combined with other information such as 
demographics, the algorithm must determine an appropriate metric, i.e., combination weights for the different 
measures. 

[0141] Once clusters have been determined, they can be used in several ways. As the profiles for the clusters 
are updated based on what the customers in the cluster watched, the profiles for the individuals in the cluster can 
be similarly updated. Thus, customer profiles can be updated both based on what they watch and on what customers 
with similar tastes watch. These modified customer profiles would be used for determining virtual channels and for 
scheduling which movies to broadcast. 

[0142] As noted above, the purpose of clustering is to group objects with high' similarity into clusters. In a 
multichannel cable television system, individual channels are often devoted to their specific "audience", or to a 
group of customers who enjoy relatively homogeneous preference profiles. Prior to the design of the features of the 
channels, it is thus necessary to recognize these customer groups as well as their collective profiles. 
[0143] There are three basic approaches towards clustering in accordance with the invention: hierarchical 
methods, clumping techniques and optimization techniques. Hierarchical methods fall further into two types - 
divisive or agglomerate. In clumping techniques, some objects may belong to several groups simultaneously 
Optimization techniques, such as the k-means algorithm and the algorithm for the p-median problem, takes the form of 
linear programming in an iterative approach. 

[0144] The two algorithms presently preferred for cable television applications are: a hierarchical method 
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53£i w£fc^ is est,rna ted . and a revised p- me dian 

[0145] The process of clustering customers in accordance with ^invention is combed offhretphases: 

1 ) Estimating the distribution of the customer population; 

2) Clustering the customer population; and 

3) Determining the correspondence of the video program categories to the customer clusters. 

[0146] Estimating the distribution of the customer population is neces^rv h^,,^ ^ . . 

parameter used to cluster the customer population depends on the ™fl«f f l Valu , eof an adjustment 

Spin -P^'S. 6 ,^ •. e**n o, 

that resist analysis by classical methods based on e t~" a h D .^ complex phenomena 

skilled in the art. hierarchical clustering meSds generat^ "tl ZZV^?^ E" 0 * AS kn0Wn «* those 
merging (agglomerative) or splitting (dLSnfof dusterl Thf voe V n f H,? t 008 by J T !? anS ° f 9 SUCCessive 
determination of similarity trees where the number f gmups q f^L^ILSSS^T^ T'T^ b ,he 
where a represents the degree of "belonging" of an element ir >s a monot ° n ° us, y as the value of a increases, 
beyond the minimum membership vZ with 1 rB^rTS^^."? 0 ^"^ ,to ^° f « 9 ° eS 
scattered population, break up at low levels of J "wlff Sfal* TlZ ' ' V « rmed gr ° Ups ' typical| y found in a 
dismantle only a. high levels of «. Con^elVl^^i^ n T P9, ^ h " P ° PU,ation ' 
yields a concave curve and the latter gK.es a conl curve " " °" X q ° P the y fhe fo ™ 

complctness 6 C ° mpadneSS ° f 3 P ° pulation ™* b * established using the following equation, where C is a measure of 

J o 



<7(2) 



Equation (27) 

This equation, of course, is difficult to calculate. It is thus preferable to use its discrete version: 



35 a„ 



a. = a, 



g («„) 
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Equation (28) 
where h B = «, - oc,,, the interval of a . A typical setting is a 0 = 0.1, a = 1 0 and h =01 
- ' [0149] For the membership function of the similarity relation, p R (ij) "which indicates the simile, k h 

custemers , and , and an agreement scalar, ac, similar to the one^fil ^^^"Sl^ 

,. , „ „ . Equation (29) 

((=1,2,... Aty=1,2....AQ 

where: 
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tijk = I <c 1Jb - c Jk ) | / s czk . cjk if c lk zo 

( i = 1,2 A7; J = 1 , 2 , . . . , N; k = 1,2 A") 



Equation (30) 

and: 



s c,*-c/k = -yj ( V + s 2 ,*) / (A/ - 1 ) Equation (31 ) 

where: 

ac,j is the agreement scalar between the profiles of customer i and the profiles of customer j; 

t, jk is the t value for significance of the difference between the rating of characteristic k by customer i and by 
customer j; 

c ik is customer i's rating for characteristic k; 

s jk is the spread (flexibility) in customer i's rating for characteristic k; 

Scik-cjk is the standard deviation between the distribution of c lk and that of c jk ; 

wv, k is customer i's weight of characteristic k; 

Wj is wv lk , i.e., the sum of all weights for customer i; and 

N is the sample size. 



[0150] One is added to the denominator in Equation (29), so that 0 < ac < 1, which is the range required for a 
valid membership value. 

[0151] The actual clustering is done by a revised p-median algorithm. Traditional p-median clustering 
algorithms require the prior knowledge of p, the number of clusters, a requirement often difficult to meet in 
reality, especially in an application of the type described here. Accordingly, a modified p-median clustering 
algorithm is preferably used which introduces p into the objective function, thereby eliminating the iterative 
nature of the algorithm and overcoming the difficulty of guessing an initial p value. 

[0152] In the population clustering method in accordance with the invention, the grouping of objects moves in 
the direction of minimizing dissimilarity between the objects, where dissimilarity between objects is indicated by 
some measure of "distance" between them. In other words, for a population V that contains N customers, any customer 
can be described in the system by a vector of K characteristics. Thus, for any two customers i and j : 

v i = [ c ir c i2- ■» c .k' • • • c {k] 
V j = [c j1 ,c J2t ...,c jk c. K ] 

which in fact are the preference profiles of the two customers, which are determined as set forth in detail above 
In general, the distance between the two customers i and j is defined as: 

. d ij = A«f tfCfl, c jk ), Equation (32) 

where f k (c lk . c Jk ) is the individual measure of distance for each characteristic. The distance may be determined in 
several ways such as absolute "City Block" distance, Hamming distance, Euclidean distance, etc. A major drawback of 
using these measures in the present invention, however, is that they fail to recognize the spread (flexibility) in a 
customer's rating for the characteristics. In order to take the flexibility into consideration, it is necessary to define: 

^=E,(^-)g/K 

Equation (33) 

(/= 1,2...N t y=1,2',...A/J f 

where w jk , W„ and t, jk are defined as above. Distance d tj is somewhat like the reciprocal of the agreement -scalar sc fj . 
[0153] In order to incorporate p, the number of clusters, into the objective function so as to optimize p, its 
coefficient should reflect the nature of the entire population. The global mean used in the formulation of 'the 
problem is thus: 



1 7 

n 2 



Equation (34) 

where d, * 0. for ij, i*j; and d, = 0, for i,j. i=j. The objective function is the minimization of the total sum of the 
distance between the .customers for the case where each customer is assigned to exactly one cluster 
For tLrm^H \ U a ( °; 5 ( - a iS 30 ad j ustment P ara ^ter. For normal populations, its value is 1 

nno„f a ?ion^K y ! ^ populatlons ' however - its valu e ™V change. For instance, in a highly scattered 
population the value of a may .ncrease so as to create highly distinguished clusters. Therefore, « is aSE 

SSfJIl i / n c ° mpadneSS measure defined in E q ua »°ns (27) and (28) above. A possible form of the function 
could be a = (3 / C. where p is a parameter whose value is determined through calibration 

t0 h 15 fJ k de f lgni ?9 of a " virtual " video channe| . which is oriented towards one or more customer qroups 

f a h °" d r ?! f perf0rmed V h3 l thecombined feature ^ of the program categories it carries match the pre eSnces of its 
Sir CUSt0 K merS J' ls ' herefore im P° rtant t0 kn °w the correspondence of various video categories to the customer 
clusters obtained after running the clustering algorithm set forth above. One way To determine the 

inTe K?r n The t2^ f M, ? B J!! f d 3 duster is to calcula ' a the "membership" of.hTSego^ 

in trie cluster. The membership function for Category 1 in cluster i is defined as: 



(-^) w 



Equation (35) 

where d„ is defined as in Equation (33). M is a weighting parameter (M > 1), which reduces the influence of 
tte mTreSbn * P "' * (CUS '° mer ^ '° The m0re M is greater ,han one - 

D. Creating Initial Profiles From Clusters 

Lmffe, Fnr^v^f * .^ ec '' on a b°ve, there are several methods for determining initial customer and content 
hlr tt , 9 • ^ custo _ mer P rofll9S ™V b e established by. having the customer select a few of his o 

^<n™ T' 83 , 0 ' ' e ^ vlslon ( 5 hows and 'hen using the content profiles of those movies or shows to constmct a 
customer profile. In addrtion, the initial customer profile may be based on replies to Questions «d«5rf7h! 
customer, or conversely, the customer may be assigned a customer profile typfeal of peop e mZTL1EZ££ 

Z U LIT 3 ?' In, " al "TS, Pr0f :' eS may be eS,abHshed ^ usin 9 ratings by experts or tes? group TEStS 
the degree of presence of different characteristics or by using the relative frequencies of words in movie reS 
or closed captioned listings and the like. However, if is often useful to use data indicating which pragrams S 
vjewer nas wacnea in order to determine the initial profiles for either new customers or new programming' 9 

222 =h * *' 1 1 CUS !??u r Pr ° fileS ° f n6W customers sh ou'd look like the content profiles of the movies 
and/or shows they wateh, and the content profiles of new movies should look like the customer profiles of the 
customers who watch those movies. If each customer has a single customer profile, the method ™ de ermhinq he 
customer profile rs s,mple: one simply finds the centroid of the content profiles of all he moZs T and™ shows 

XV, LSr h , V h ° W ' thS m ° VieS Walched ^ a customer must be clustered info groups for selectbn 

of the centroid average) of each group. Similarly, if one has a list of people who have watched a movie or show 
one can determine a content profile for that movie or show by clustering the profiles of the customers and selecting 
the profile cluster containing the most customers. customers and selecting 

[0158] By using clustering techniques, one can also determine an initial customer profile even if no historv of 
ntrhnn £SE H preferenceS , is avai,able - in ^u\ar, by clustering customers ba'sed on demograote o 
psychographic data, new customers may be assigned customer profiles typical of customers with similar demoaraohics 
or psychographics. On the other hand, when no characteristics are known for movies or custome fa™!^ 
ma.™ indicating which movies each customer is likely to watch may be computodTom a record I o^whlch mov^eaS 



customer has already watched. As described above, this agreement matrix can be used for selecting a set of virtual 
channels for each customer, for scheduling movies for delivery over a cable or equivalent transmissfon Stem and 
for making move rental or other rental or purchase recommendations at a kiosk or personal computer (described 
below). The key to generating the agreement matrix using this approach is the observation that if two peopf have 
liked many of the same movies or shows in the past, then they are likely to continue to like simflar movies or 

S^J^h f f 5 haS S6en and ,iked man y movies or shows which a sec °nd Pereon "B" has seen 

and liked then A is likely to like other mov.es or shows which "B" liked. The method set forth below generalizes th£ 
concept to multiple customers. a mis 

[0159] In the simplest use of clustering, a record is kept of all movies or shows watched by all customers If 
he customers are not identified they are identffied by whether or not their television is on. The customers are 
then grouped so that people who have watched more movies or shows in common are more likely to be in the same 
group. In other words, the customers are divided into groups to minimize the sum over all the groups of the sum over 
all pairs of group members of the distance between the members. Practically, this means that the distance from the 
centroid of he group .s computed since it is cheaper to compute. Since the inverse of the distance is a measure of 
agreement, the clusters are preferably selected to maximize agreement among the cluster members 
[01 60] Once the customers have been clustered into groups, the effective popularity of movies or shows for the 
cluster can be determined by counting the total number of times each movie or show was watched An- agreement 
.matrix between the customers and movies or shows may be constructed based on these clusters by assiqnino each 
customer the agreements ("effects popularity") of the movies or shows for the cluster that the customer I in 
where all members of a group have the same agreement. ^romer is in, 

[01 61 ] In particular, a technique for creating initial profiles from cluster data includes the steps of: 

(1 ) picking the number of desired groups. K; 

(2) using the k-means algorithm to group the customers into K groups to minimize the sum over all the groups of 
the sum over all pa,rs of group members of the distance between each group member and the group centroid In 



other words, it is desired to minimize 

clusters < 
k=l to K i in k 



clusters customers 



Equation (36) 



where |v, - v k | is the distance between the vector of movies watched by customer i and the centroid of cluster k v is 
E!did°ean^L^ ^ CUSt ° mer ' and Zer ° for each movie not watchsd - The simplest measure is the 

\\\ - v k | = sqrt ( £ (v in ~ v k J 2 ) 

m 

' Equation (37) 

where v Im is the value corresponding to the mth movie watched (or not watched) by customer i; and 

(3) determining the agreement matrix elements ac /? . For each customer i, the jth row of the agreement matrix is 
just the vector v k for the cluster k that the viewer i is in. 



[0162] As an example of this technique, assume the following viewing history, where each V indicates that a 
video program (A-G) was watched by a customer (1-5): inai a 

\program A B c D E F G 

customer\ 

1 v 



x x * x 

2 x x X 

3 v v 



4 
5 
G 



x 

xxx 

X XX 
x x X X 



™l /i ^55 7 4 ^ a s tandard algorithm such as k-means clustering will divide the above people into two 
groups. {1,2,3} and {4,5,6}. The centroid of each cluster is found by calculating the average number of viewers of 
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each movie in the cluster. The centroid of the {1,2,3} group is {2/3,2/3,1,1,1/3,0,0}, corresponding to movies A, B, 
C, D. E F, and G, respectively. The centroid of the {4,5,6} group is {0.0.1/3,1.2/3,1,2/3}. The resulting agreement 
matrix is thus: 

\program 
customer\ 
l 

2 
3 
4 
5 
6 



A 


B 


r 


U 


h 


F 


G 


2/3 


2/3 


1 


1 


1/3 


0 


0 


2/3 


2/3 


1 


1 


1/3 


0 


0 


2/3 
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In a broadcast/cable application of the type described herein, it may be desirable to construct different 
agreement matrices of this type for different times of day or days of the week. 

[0164] This technique can be refined in two ways: (1) by using fuzzy clustering techniques, where a customer 
may belong to different clusters, and (2) by requesting a rating from each customer for each movie viewed In the 
case of fuzzy clustering, each customer gets an agreement matrix which is the sum of the agreement matrices for the 
groups he or she belongs to weighted by the degree to which the viewer belongs to the group. In the case of rating 
requests, on the other hand, the clusters are made not just based on whether the movies were watched but also based 
on how much they were liked as determined from the viewers' ratings of the movie (for example, on a scale of 1 to 10) 
. In the latter case, a different distance metric should be used so that an unrated movie is not confused with a 
20 movie that was viewed and not liked. An appropriate metric is to use the Euclidean distance but to exclude all 
programs not reviewed by the customer A preferred embodiment for kiosks or personal computers (described below) 
incorporates both of these refinements. 

[0165] One skilled in the art will recognize that many additional variations on this -technique are possible 
within the scope of the invention. For example, instead of a standard Euclidean distance metric one may wish to use 
the inverse of the fraction of movies which were watched by both members of the pair. As another alternative 
25 agreements can be normalized by the number of movies or shows the customer has seen Also customers who do not 
want to watch movies repeatedly may block the viewing of recently viewed movies to avoid repeat viewing. 

VI. Hardware Implementation of Profile System 



30 [0166] Two hardware embodiments of the invention may be used to implement the system described above In a 
so-called one-way data transmission system, no feedback from the set top multimedia terminal is provided for 
adjusting the customer profiles or the content profiles. In a two-way data transmission system, on the other hand 
passive feedback techniques are used to better personalize the video offerinas over time. Both hardware embodiments 
will be described below. " 



35 A. One-Way Data Transmission System 

[0167] In a one-way data transmission system in accordance with the invention, a customer profile system in 
accordance with the invention calculates the agreement matrix at the customer's set top multimedia terminal from the 
customer profiles stored in the set top multimedia terminal and the content profiles of the received video 
4Q programming. This technique allows the set top multimedia terminal to create "virtual channels" of the video 
programming receded which the set top multimedia terminal deems most desirable on the basis of the customer's 
profile(s). 

t0168J a „ \ h9 f ' rSt embodiment tnus does not use ar| y of the feedback and updating techniques described above. 
Figure 4 illustrates a generalized diagram of such a one-way video distribution system in accordance with the first 
hardware implementation of the invention. As illustrated in Figure 4, a plurality of program source materials 402 

45 are modulated by a plurality of channel modulators 404 and distributed via distribution system 406 at head end 408 
and via respective nodes 410 to set top multimedia terminals 412 in the homes of the head end's customers In this 
embodiment of the invention, the set top multimedia terminals 412 and/or the distribution system 406 include 
software such as that described above for determining an agreement matrix for each customer. The agreement matrix 
suggests programming for "virtual channels" and/or controls the tuners of the set top multimedia terminals 412 to 
select the most desired programming for the customers in accordance with the customer's profiles In other words a 

w plurality of "virtual" channels are created from the agreement matrix, and the selected programming is provided from 
each set top multimedia terminal 412 to the associated television. The customer then decides whether he or she wants 
to watch one of the "virtual" channels or one of the conventional channels. 

[0169] In the embodiment illustrated in Figure 4, the set top multimedia terminals 412 sit on top of the 
television and receive as input the shows being broadcast and their associated content profiles (either in the bit 
55 stream, the vertical blanking interval, or separately as part of the electronic program guide information) The set 
top multimedia terminals 412 have the customer profiles for that residence prestored therein Set top multimedia 



terminal 412 may also include means for monitoring which shows are being -watched by the customer. From this 
information, the customer profiles stored in the set top multimedia terminal 412 may be modified by the software of 
the set top multimedia terminal 412 using the techniques described in Section II.B. above. In other words, each set 
top multimedia terminal 412 preferably includes means for updating the customer profiles based on what the customer 
actually watched. However, the set top multimedia terminals 412 do not provide the list of the watched programs back 
to the head end for adjusting the video programming schedule since a two-way data transmission system would be 
required. 

B. Two-Way Implementation 

[01 70] The second embodiment of the invention incorporates the above-mentioned passive feedback techniques to 
provide information from the set top multimedia terminals back to the head end so that the video programming 
schedule may be adjusted and so that targeted advertising and the like may be provided from the head end. This 
embodiment differs from the first embodiment in that data regarding the customer's selections of programming is 
collected by the head end for use in future program scheduling. Data collection in accordance with the invention is 
the process by which the customer viewing results and/or profiles are collected by the CATV and/or conventional 
^ broadcast system for subsequent processing and assimilation. In the two-way implementation, the customer profile 
system is implemented at the video head end by creating an agreement matrix for all customers from customer profiles 
stored at the head end and content profiles of the video programming to be transmitted. This technique allows the 
video head end operator to objectively determine which video programming is most likely to be desired by his or her 
customers and also allows one to minimize the memory requirements at the set top multimedia terminal. 
[0171] Memory requirements at the set top terminal are minimized by limiting, at the head end, the amount of 
data which is transmitted to the set top terminal for storage in accordance with the profiles of the user. In other 
words, data which does not match the profile of a particular customer is not transmitted and stored at the set top 
terminal. For example, electronic program guide (EPG) data which is normally transmitted to the user for all 
channels is limited at the head end to only that EPG data which matches the profile of the particular customer. In 
one 'example, the portion of the EPG listing those programs which are not likely to be of interest to the customer 
are not downloaded at all. In another example, the titles for all EPG programs are transmitted, but the program 
descriptions are downloaded only for those programs likely to be of interest to the customer. Thus, the agreement 
matrix acts as a filter for filtering the data which is downloaded to the set top terminal for storage. In this 
manner,- the hardware demands for memory at the set top terminal are minimized, without sacrificing any material of 
interest to the customer. 

[0172] Two main hardware implementations for data collection are described herein with reference to the 
preferred two-way embodiment: telephone system return and CATV system return. Both of these approaches utilize a 
"wired" return path for data collection. In addition, those skilled in the art will appreciate that several wireless 
alternatives for data collection are possible. The specific implementation selected depends upon several variables, 
including the technology in- place on the CATV or conventional over air broadcast system, specific polling techniques 
employed, telephone system flexibility, the required/desired frequency 5 for polling the data, and the level of 
maintenance employed on the CATV or conventional over air broadcast system. Details of a telephone system 
implementation are highlighted in Figures 5 and 6. 

[0173] Figure 5 illustrates a generalized diagram of a two-way video distribution system in accordance with the 
invention. In this embodiment, the customer profile information and viewing habit information from the individual 
set top multimedia terminals is relayed to the head end 502 on a periodic basis for updating the agreement matrices 
on a system level to determine what video programs should be transmitted in particular time slots. As in the one-way 
embodiment of Figure 4, program source material 402 is modulated onto respective channels by modulators 404 for 
distribution to the customers. However, in the two-way embodiment of Figure 5, the head end 502 includes a 
distribution system 504 which is controlled by system controller 506 to schedule the presentation of the program 
source material 402 to the customers in response to passive feedback data stored in data collection memory 508 which 
has been received from the customers' set top multimedia terminals 412. In particular, the customer profile data and 
viewing habit data is collected and periodically provided via return path 510 to data collection memory 508 as a 
record of what the customers desire to watch and what they actually watched. 

[0174] In accordance with the techniques described in detail above, this information is then used to 
appropriately update the system profiles (composite of all customer profiles) and/or the content profiles of video 
programs and thus, in turn, is used in adjusting the scheduling of the program source material 402 for transmission 
via nodes 410 to the respective set top multimedia terminals 412 in the customers' homes. As in the one-way 
embodiment of Figure 4, each set top multimedia terminal 412 then determines "virtual" channels for presentation to 
the customers' televisions. As noted above, return path 510 preferably constitutes a telephone connection, although 
the return path 51 0 could also be a portion of the broad band cable connection. 

[0175] Figure 6 illustrates an actual cable television distribution system" for a cable television 

implementation of the present invention. As illustrated, a variety of modulated program sources 602 are provided 
The programs are selectively (and dynamically) provided to each node via a dynamic program matrix switch 604 at the 
cable head end. Also at the head end is a cable television (CATV) system controller 606 which designates which 
programs are to be delivered to each node. The video signal from the switch 604 is amplified by amplifiers 608 and 
then transmitted over conventional optical fiber and/or coaxial cables 610 to splitters 612 and repeater amplifiers 
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sending the long key, a single encrypted initializer is sent. The system is unbreakable to the extent. that the 
public key system (RSA) is unbreakable, but the computational simplicity of the one-time session key allows it to be 
implementable in hardware for very fast encryption and decryption at the head end and at the set top multimedia 
terminal. 

[0183] Thus, as shown in Figure 7, upstream encryption for a secure transmission path for transmitting 
preference data, profile data and the like from the set top multimedia terminal to the head end is performed as follows: 

(1) At the head end, generate a seed random number N to be used for the random number generator (step 702). 

(2) Retrieve the public key P from the set top multimedia terminal (step 704) and encrypt the seed random number 
N as E(N,P) at the head end using a public key algorithm such as RSA which is known to be difficult to break 
(step 706). 

(3) Send the encrypted seed N (E(N,P)) to the set top multimedia terminal (step 708) where E(N,P) is received 
(step 71 0) and decrypted to yield N using the set top multimedia terminal's private key Q (step 71 2). 

(4) The head end and set top multimedia terminals then initialize their respective pseudo-random number 
generators with N as a seed (step 714). 

(5) Begin the encryption at the set top multimedia terminal (step 716) by having the encryptor generate the 
first number in the sequence K, and logically exclusive-ORing it with the first data word in the stream P. thereby 
forming C, (i.e., C, = EOR (K,,P,)). 

20 (6)'Send the result C, from the encryptor at the set top multimedia terminal to the head end (step 718), where it 

is received by the head end (step 720). 

(7) Form K, at the synchronized random number generator of the head end, which has also been initialized with N, 
by decrypting the received C, to yield P v This is done by exclusive-ORing K, with C, to yield P, (i.e., P, = EOR(K,, C,))' 
25 (step 722), generating the next pseudo-random K, in the sequence at the head end and the set top multimedia 

terminal (step 724), determining whether all words i in the sequence have been decrypted (seep 726) and 
repeating steps 716-726 until all words in the digital video stream have been decrypted. Normal processing of 
the digital video stream continues from that point (step 728). 

30 [0184] " As illustrated in Figure 8, for encryption of the video programming data transmitted from the head end 
to the set top multimedia terminals, the procedure is identical to steps (1)-(7) above illustrated in Figure 7 
except that the roles of the head end and set top multimedia terminal are reversed. 

[0185] v Advantages of such an encryption/decryption technique include the fact ihat the operations for 
encryption and decryption include only an exclusive-OR, which is a one gate delay logical operation Also many 
35 random number algorithms may be implemented which execute rapidly in hardware shrft/divide/accumulate registers 
Accordingly, it is desirable to use such an encryption/decryption technique to maintain the security of the two-wav 
data transmission system described in this section. 

C. Set Top Multimedia Terminal Embodiments 

40 [0186] Figure 9 illustrates a software block diagram of an embodiment of a multimedia terminal 620 for use in 
the one-way and two-way system embodiments described above. As illustrated, the video program material and the 
associated content profiles are received at the set top multimedia terminal 620 from the head end 408 A program 
list indicating those video programs which the user of that set top multimedia terminal 412 has available and is 
authorized to receive is stored in memory 902. The associated content profiles (program characteristic lists) is 

45 preferably received with the electronic program guide data and stored in memory 904. From the content profiles 
stored in memory 904, processor 906 calculates and updates the agreement matrix using the techniques described in 
detail above and stores the resulting agreement matrix in memory 908. As noted above, the customer profiles used in 
calculating the agreement matrix preferably differ in accordance with the time of the day and of the week and/or the 
expected mood of the customer. Accordingly, a record of the time of day is stored in memory 910 as received 
separately from the CATV head end or as input by the customer and maintained locally at the set top multimedia 

50 terminal 620. Similarly, the expected mood of the customer is stored in memory 910. As desired, the expected mood 
may be accessed and modified by the customer. 

[0187] From the agreement matrix determined by processor 906 and stored in memory 908, a list of "preferred 
channel selections" or "virtual channels" is determined. An electronic program or display guide 914 listing the 
available selections is provided. In accordance with the invention, the display guide 914 is either modified to 
include fields for the "virtual" channels, or else the recommended programming is highlighted in an obvious manner 
or reordered for the customer's perusal and selection of the desired programming. Once the customer has selected the 
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also mav be modified for use in an interactive system to anticipate what customers are likely to request so that the 
information may be downloaded in advance using, for example, a simple Markov model and/or probability transiton 
matrices in an event graph. The present inventors contemplate many such embodiments within the scope of the claims 
and will highlight a few such embodiments below. Of course, many other embodiments within the scope of the claims 
will become apparent to those skilled in the art. 



A. Video Distribution Systems 



r0195] As described in detail above, a preferred embodiment of the invention determines an agreement matrix for 
matching customer preferences to available video programming and presenting the most desirable video programs on 
one or more "virtual channels" customized for the customer, thereby minimizing channel surfing . This is 
accomplished by calculating an agreement matrix which matches the characteristics desired by customers with 
corresponding characteristics of the video programs. In one alternate embodiment of the invention described above, 
video programs that tend to be liked by the same people are clustered together or, on the other hand, customers with 
similar interests are clustered together using the agreement matrix. In this manner, the system of the invention is 
used to determine which video programming best meets the needs of a designated viewership. 

r0196] It has also been suggested above that clustering techniques may be used to provide a relatively 
homoqeneous population with targeted advertising. What is significant about the invention in this context is that 
the agreement matrix may be updated based on feedback including actual purchases made by the customer in 
response to such targeted advertising. For example, when shopping at home using infomercials, as when watching a 
movie the products available for purchase can be characterized using different attributes and an agreement matrix 
formed between customer profiles and product profiles. The agreement matrix can also be used to select infomercials 
or other advertisements that the customer is most likely to watch and to respond to by making purchases. If purchase 
information is available, the customer profiles can be updated using the same algorithm described above with respect 
to video programs, but now the updating is based on what the customer actually purchased as well as what 
infomercials he or she watched. 

r0197l The clustering method of the invention may also be modified to include sociodemographic profiles of 
customers Such information may include ages, gender, and race, as well as other information provided by the 
customers themselves. On the other hand, the clustering data may include census data such as zip code data. For 
example as noted above, a zip code may be used as one watf to categorize the customer profiles of the customers 
whereby a new customer to a system would get one or more of a number of generic customer profiles for a particular 
ziD code as his or her initial customer profile. The initial customer profile would then be modified as that customers 
viewinq habits are established. As noted above, such modifications may be accomplished using psychographic data, 
customer preference profiles input directly by the customer, past movie selections, rave reviews, passive feedback 
based on actual television viewing by that customer, records of customer purchases, and the like. 
r0198] It will also be appreciated that the one-way and two-way systems may coexist in a hybrid system. In such 
a hybrid system, the feedback paths from the two-way set top multimedia terminals could be used by test audiences to 
provide initial content profiles for new movies before the movies are made available to all. Similarly the feedback 
oaths from the two-way set top multimedia terminals may be used to provide initial content profiles for subsequent 
episodes of television series. By using this approach, no experts or studio test groups would be needed to establish 
the initial content profiles for new video programming. Also, content profiles would be available for all video 
programming other than that provided solely to the test audiences. 

r0199] Generally the two-way set top multimedia terminals would belong to customers connected to the same 
nodes as other customers having one-way set top multimedia terminals. As a result, the content profiles determined 
from the test programming and the like may also be used to provide initial customer profiles specific to a new 
customer to that node Such a technique may also be used to monitor changing preferences and even changes in 
demographics for the customers connected to each node by periodically updating the clustered customer profiles for 
that node to reflect the changes in the customer profiles of those customers connected to a particular node. 
f02001 Those skilled in the art will also appreciate that the invention may be used in the context of a "Home 
Video Club" of the type described by Herz et al. in U.S. Patent No. 5,351,075 to schedule desired programming. In 
addition, the invention may be used as a navigational aid to heip customers determine what they want to wale, as 
well as to target a set of movie previews for particular customers to examine. 

r0201] Those skilled in the art will also appreciate that the basic agreement matrix described above can be 
generalized to include various weightings such as national popularity, customer requests for movies customer 
requests for times, data on viewership by category and time, and the like. The present invention is also flexible 
enouqh to allow the scheduler to keep regular shows at regular times to draw customers while giving the customers 
the options to select the "best" of what is available on the other channels. In such a scenario,, one could mix 
network television with special cable programming as well as video on demand. Of course, each customer could also 
have one or more of his/her own "customized" virtual channels showing his or her own requests. Similarly each 
customer could adopt the customer profiles of other individuals or programs such as "celebrity' profiles including . 
the viewing preferences of different celebrities. However, such "celebrity" profiles must not be updated through 
passive feedback as described herein and should remain unchanged. 

r02021 Also since there is usually more than one television viewer in a household, it may be desirable to keep 
multiple clusters' of preferences for one television. Those skilled in the art will appreciate that this may be 



handled in a manner similar to the different moods described above. For example, the customer profiles of two or 

^ftf^T 5 T b ,! C ° mb L ned ' WKh SqUal ° r UnequaJ wei 9 htin 9 s - 30 that ^e video prog^ammS St cSen 
profiles strictly within the overlap area of the combined customer profiles will be preferred In this mamer 

[0203] Also the techniques described above may be used to create a virtual channel for video previews wherebv 
prevews of movies and the like available in an on-demand system, for example, may be presented to Tstomers n a 
personalized manner. This may be accomplished even in hotels and the like by providing individuals with perS£d 
ID cards wh.ch store their profiles and card readers at the set top multimedia Terminals which in the 

customer's profiles from the ID cards for local recreation of the customer's agreement maS if desirei he 
updated customer profiles maybe stored back to the ID card at the end of the customer's teleXio^ viewing 

B. Video, Music and Bookstore Kiosks 

[0204] The methods of the invention also may be implemented in a kiosk or personal computer as illustrated in 
.n h ?: Th 8 D- 3 , Vlde0 ' mUS ' C , and/0f b °° k St0re t0 he 'P CUstomers decWe ^ich videos to rent o muf and 

5 237 157 to^Lf 5 ° r P M SOr ! a L COmpUter W ° Uld be Similar in structure t0 the kiosk discl ^ed in U.S Sent No 
5,237 157 to Kaplan and would include a microprocessor 1 102. However, a kiosk or personal computer implemented °n 
accordance with the invention also accepts identity information from the customers either via TyboaSl TlM ? r bv 
electronic reading of a membership card by an electronic card reader (not shown) and retrieves customer profiles S 
that customer from memory 1106 for use in forming an agreement matrix as described above Those : S d he art 
will apprecate that unite the broadcast embodiment above, it is necessary in the kiosk embodiment /match the 
customer profiles to .ndMduab by name or user ID rather than time slot. Such values are provided vL keSoa d 
1 1 04 or an electronic card reader so that the customer profiles for that customer may be retrieved ^ 

SfSi^tin!? 6 ^ ™ ndat '° ns ara th f? selected bv microprocessor 1 102 using the same algorithm described above for 
the selection of virtual" channels. Movies which were recently rented by the customer could be determined bv 
checking that customer's rental records and optionally be removed from the list presented to the custom! Sstomer 
profiles also wou d be updated based on the movies selected using the algorithm and optbSy could be alterod to 
include a rating of the movie provided by the customer when he or she returns the video 

™.o ae I H ! .? r0fi ' in9 lsc ^ ve ° f j h ? invention als ° f°rms the basis for a customer to select a movie by 
example, as in a rave review" described above. As described in Section V.B. above, since customers often do not 
have existing profiles, new customers may create an initial customer profile by selecting one or movies which are 
simitor to what he or she is looking for so that the profiles of these sample movies may be looked Tand averaged 

Lff h^fr P T?k 6 - T k ,S C , USt ° mer Pr ° file is USed in Nation with a" standard J TvSSSto 
2525, "T p0rtance ° f j he characteristics to generate an agreement matrix indicating how much the Somer 

value for a!) ^Tn^^Z ai3b 1- ^ 3 to -f ( ° r 10 m ° vieS > with the hi S hest agreement (mSmTm 

t ! ' are . then P rese "ted to the customer via video processor 1108 for display on display device 1110 alona 

U B CD St'Mi S° Ve ' T je H Ca ° be / X , ClUded WhiCh the CUSt0mer has racentlv rented^ shown i F?S 
\l ,u I- u ^ P y 1112 maV a,S0 be provided at tne kiosk t0 facilitate the playing of short " c |i P s" of ™ZZ 
with the highest agreement to further assist the customer in his or her final selection 

Eill • .h^ 0 ^ 6 ' in ' erestin 9 aspect of kiosk embodiments in which user IDs are used to select the customer 
f™!,. V T t6m be , USSd t0 fadlitate the selection of vide °s which will appeal to several peoSe 
rZ TT P • r, US,0 f mer m ! y enter the User IDs for ,nose individuals expected to watch a particular movie rentot 
^ru^^h f f ° r 6 t aCh PerS ° n are re, J ieVed and compared t0 the customer profiles of the othersTnfer™ by 
the customer The intersection or averaging of the customer profiles may then be used in determining the aaSement 

TnlZ 30 ''I h6 Wi " SSleCt ' h0Se Vid60S W " h the most ap P eal t0 a » Pe^ns specified by toT customer ° 

[0208] Alternatively, when an agreement matrix is implemented in a music or book kiosk to aid in the selection 
iToh ° r °° S ' thech f acteristics of movies are replaced by the characteristics of music or books Fofmusfc 
otht S m 'f lnC '^ e Standard clas s^ations such as rock, easy listening, classical, count™ or 
?^* a Z*~*~™ S ™% ! S Z eT i°^>? 9 artiSt % d f, Cade or centur V tne music was written, approximate yeTr of 
au^f' %ZhZ7 T ■r c 1 C " a " 3 ' ' 6ngth and lhe like ' wnSe books, such characteristics might include 
author standard classifications such as mystery, fiction, non-fiction, as well as length, date of lK 
pub cation and the like. Characteristics of the music or books would similarly be matched against hose desired bv 
Srablf " 39re9ment WhiCh W ° U ' d *"* [he CUSt ° mer to < hose selectionsmost iSSyta Tbefound 

[0209] Music kiosks and book kiosks could also be used in music and/or book stores to aid in the s-lection of 

Un ' ike f"? d f SCribed in thS Kap ' an ' 15? Patsnt ' however ' the «* k would a oJ 
potenfial purchasers to look up music or book selections by example and would match the customer's preferences to 

the characteristics of the available inventory. The potential buyer could listen to segments of hosf music 
selections or review the summaries and reviews of those books with the highest agreement to the customer profite 
created from the sample music or book selections. i « «« i-usiomer proriie 

[021 0] Also as in the video embodiment described above, the content profiles of certain radio stations mav fa- 
used to assist the customer in selecting a radio station from those available, or alternatively a "Stuar radio 
channel may be created for over air or cable transmission. Of course, the concepts herein descTbed may be us? to 



schedule music videos and to schedule the transmission of music over air or cable transmission systems Feedback 
could also be used to improve the content and customer profiles as described above with respect to video program 
ssisciion. 



C. Data Retrieval Systems 



[0211] Those skilled in the art will appreciate that the method of generating agreement matrices for selecting 
preferred video programming as described herein may be generalized for use in other types of data retrieval systems 
besides video and music. For example, the techniques of the invention may be used for the optimum selection of anv 
chunks of information such as stock market data, print information (e.g., for personalized newspapers) or 
multimedia information which can be downloaded over networks such as the Internet. 

[0212] In the case of retrieving stock market data from a computer network, response times for retrievina 
certain stock market data can be shortened by anticipating which menu selections the customer is likely to use and 
downloading that information in anticipation of its likely use. One particularly useful example of this would be the 
retrieval of information about stocks such as recent trade prices and volumes. Since stocks, like movies can be 
characterized in multiple ways, such as by industry, dividend size, risk, cost, where traded, and the like' profiles 
of stock may be developed in a similar manner to that described above. The stocks also can be characterized bv 
whether they are owned by the customer and by whether they have exhibited unusual recent activity These 
characteristics can be used to create profiles and agreement matrices using the identical techniques described 
above In addition, if a customer exhibits a pattern in their request for information about stocks their requests 
can be anticipated and menus assembled to ease selection of the stocks so as to avoid potentially long searches 
through multiple windows, or the information can be downloaded in advance of the customer's request to reduce 
waiting time. Such anticipation of customer requests for information is particularly useful when the waitinq time may 
be significant, as for multimedia information incorporating graphical or auditory information. It is also valuable 
when large amounts of information can be transferred at lower cost, for example, using lower cost transmissions at 
night in anticipation of requests for information the following day. 

[0213] Similarly, in the case of retrieving text or other print information, a customer may be aided in his or 
her navigation through a tree of possible menu items by having the system anticipate which branches are most likely 
to be followed and downloading information in advance of the information being requested, thereby significantly 
speeding up the system response. Old information which is unlikely to be used can be flushed from memory This 
allows information to be ready at the local machine when it is needed. 

[0214] Also, media cross-correlation is also possible using the techniques of the invention by usinq the 
profile from one media to estimate the customer preference for another media. Such an approach might be useful for 
example to predict that an avid customer of sports programs could also be very interested in obtaininq sports or 
news information or information regarding the purchase of sports memorabilia based on his or her viewinq 
preferences. Likewise listeners of a particular type of music may also be interested in purchasing concert tickets 
for the same or similar types of music. 

[0215] Finally, the techniques of the invention may be used to match a potential purchaser to real estate on 
the market by creating profiles of the characteristic features of a house such as size, location, costs, number of 
bedrooms, style, and the like. The potential purchaser can request his or her "dream home" by giving example houses 
by specifying desired characteristics such as range of prices, or by a combination of the two. The agreement matrix 
would match the customer's profiles to the profiles of the available homes and create an agreement matrix The 
system could also verify that the profiles initially entered by the potential purchasers are accurate by sugges inq 
houses of a somewhat drfferent type than those the customer has requested. A house retrieval system which is 
customer controlled could also be developed using the techniques of the invention. In this example, the data source 
would be the standardized real estate listings. 1,6 
[021 6] Although numerous embodiments of the invention and numerous extensions of the inventive concept hav- 
been described above, those skilled in the art will readily appreciate the many additional modifications are 
possible in the exemplary embodiments without materially departing from the novel teachings and advantages of the 
invention. Accordingly, all such modifications are intended to be included within the scope of this invention as 
defined in the following claims, 85 

Claims 

1 . A method of scheduling customer access to data of a plurality of data objects, comprising the steps of: 

creating (104; 110, 112) at least one customer profile for a customer of said data, said customer profile 
indicating the customers preferences for data having predetermined characteristics; 

creating content profiles for each of said data objects, said content profiles indicating the degree of 
content of said predetermined characteristics in data of each of said data objects; 

relating (106, 304) said at least one customer profile with the content profiles for the data available 
from each data object to the customer .a^particular time; " 



determining (204, 206; 304, 310) a subset of data having content profiles which are determined in said 
relating step to most closely match said at least one customer profile; and 

presenting said subset of data to said customer for selection; 

characterized in that 

said content profiles for each of said data objects contain values over a predetermined range of values 
indicating correspondence of the data objects to said predetermined characteristics; and 



said step (204, 206; 304, 310) of determining a subset of data comprises a comparison to 
indication of the attractiveness of said data objects for each customer. 



2. A method as in claim 1, wherein said relating step (106; 304) comprises the step of determining a distance 
between a customer profile and a content profile in a characteristic space by calculating an agreement scalar 
for common characteristics, ac, between said at least one customer profile, cv, and said content Drofiies cn 
in accordance with the relationship: H * p ' 

^jrl/ll^w^cv^-cp^], 

for i - a particular customer of a number of customers I, j = a particular data object of a number of data 
objects J, and k = a particular characteristic of a data object of a number of data object characteristics K where 
cv ik is greater than or equal to 0 and w ik is customer i's normalized weight of characteristic k. 

3. A method as in claim 1 , comprising the further steps of: 

monitoring (1 1 0) which data objects are actually accessed by said customer, and 

updating (112; 310) each customer profile in accordance with the content profiles of the data objects 
actually accessed by that customer to update that customer's actual preferences for said predetermined 
cnaractenstics. 

4. A method as in claim 1 , comprising the further steps of: 

monitoring (1 1 0) which data objects are actually accessed by said customer; and 

updating (112) said customer profile to reflect the frequency of selection of the data objects by customers 
with customer profiles substantially similar to said customer profile. 

5. A method as in claim 4, wherein said customer profile creating step (1 04, 1 1 0, 1 1 2) comprises the steps of: 

selecting a number of desired groups K into which said customers are divided whereby each customer in a 
group has a customer profile substantially similar to a customer profile of each other customer in said group; 

grouping said customers into K groups so as to minimize: 

Z Z I v i ~ vj 

clusters customers 
k=l to K 1 in v 

where | Vj - v k | is a distance between the vector of characteristics of the data objects accessed by customer i and 
the centroid of group k; and 

determining an agreement matrix ac ijt where for each customer i, a jth row of the agreement matrix is a vector v 
k for a group k in which customer i belongs. 

6. A method as in claim 1, wherein said step (104; 110, 112) of creating at least one customer profile comprises 
the step of including in said at least one customer profile a profile of data previously accessed by said customer. 

7. A method as in claim 1, wherein said (104; 110, 112) step of creating content profiles comprises the step of 
including in said content profiles data reflecting the customer profiles of those customers who have pr-viouslv 
accessed said data from each data object. y 



8. A method as in claim 1, wherein said data includes electronic program guide data, and said presenting step 
comprises the step of downloading, in a memory of a customer's electronic program guide. receiving device, 
program descriptions in said electronic program guide only for those entries in said electronic program guide 
which have content profiles which are determined in said relating step to most closely match said at least one 
customer profile. 

9. A method as in claim 1, wherein said customer is connected to a plurality of data sources (406; 602; 1 12) over a 
data network, and wherein said presenting step comprises the step of downloading said subset of' data to said 
customer from said data source prior to selection by said customer of a menu item identifying said subset of data. 

1 0. A method as in claim 9, comprising the further steps of: 

monitoring (110) a customer's pattern in navigating through said data sources to access data havinq 
characteristics desirable to said customer; and 

creating (112; 310) at least one customer profile for said customer indicating the customer's pattern in 
navigating through said data sources to access said data having characteristics desirable to said customer, 

wherein said downloading step further comprises the step of downloading said data having characteristics 
desirable to said customer from said data sources prior to selection by said customer of menu items used on said 
data network for navigating to said data. 

11. A data transmission system which schedules customer access to data of a plurality of data objects, comprising: 

means (406; 412; 506; 606; 906; 1006; 1102) for generating at least one customer profile for a customer of 
said data, said customer profile indicating the customer's preferences for data having predetermined 
characteristics; 

means for generating content profiles for each data object, said content profiles indicating the degree of 
content of said predetermined characteristics in data of each data object; 

means (406; 412; 506; 606; 906; 1006; 1102) for relating said at least one customer profile with the 
content profiles for the data available from each data object to the customer at a particular time; 

means (406; 412; 506; 606; 906; 1006; 1102) for determining a subset of data having content profiles which 
most closely match said at least one customer profile; and 

means (110) for presenting said subset of data to said customer for selection, characterized in that 

said content profiles for each of said data objects contain values over a predetermined range of values 
indicating correspondence of the data objects to said predetermined characteristics; and 

said means (406; 412; 506; 606; 906; 1006; 1 102) for determining a subset of data comprises means for 
generating a comparison of the attractiveness of said data objects for each customer. 



2. A system as in claim 11, wherein said relating means comprises a processor (406; 412; 506; 606, 906; 1006; 
1102) programmed to determine a distance, between a customer profile and a content profile in a characteristic 
space by calculating an agreement scalar for common characteristics, ac, between said at least one customer 
profile, cv, and said content profiles, cp, in accordance with the relationship: 

ac>//[/ + 2 k w jk \cv ik -cp jk \], 

for i - a particular customer of a number of customers I, J = a particular data object of a. number of data 
objects J, and k = a particular characteristic of a data object of a number of data object characteristics K, where 
cv , k is greater than or equal to 0 and w jk is customer i's normalized weight of characteristic k. 

3, A system as in claim 1 1 , further comprising: 

means (41 2; 620; 1 008) for monitoring which data objects are actually accessed by each customer; and 

means (412; 620; 1008) for updating each customer profile in accordance with the content profiles of the 
data objects actually accessed by that customer to update each customer's actual preferences for said 



predetermined characteristics. 



14. A system as in claim 1 1 , further comprising: 

means (41 2; 620; 1 008) for monitoring which data objects are actually accessed by each customer; and 

means (412; 620; 1008) for updating each customer profile to reflect the frequency of selection of the data 
objects by customers with customer profiles substantially similar to said each customer profile. 



15. A system as in claim 14, wherein said means (412; 620; 1008) for generating at least one customer profile 
comprises processing means which selects a number of desired groups K into which said customers are divided 
whereby each customer in a group has a customer profile substantially similar to the customer profile of each 
other customer in said group, groups said customers into K groups so as to minimize: 

E E |v 4 - v k | 

clusters customers 
k=l to K i in k 

where |v, - v k | is a distance between the vector of characteristics of the data objects accessed by customer i and 
the centroid of group k, and determines an agreement matrix ac ?f where for each customer i, a jth row of the 
agreement matrix is a vector v k for a group k in which customer i belongs. 

16. A system as in claim 11, wherein said means (412; 620; 1008) for generating at least one customer profile 
includes in said at least one customer profile a profile of data previously accessed by said customer. 

17. A system as in claim 11, wherein said means for generating content profiles includes in said content profiles 
data reflecting the customer profiles of those customers who have previously accessed said data from each data 
object. 

18. A system as in claim 11, wherein said data includes electronic program guide data, and said presenting means 
comprises means for downloading, in a memory of a customer's electronic program guide receiving device program 
descriptions of an electronic program guide (914) in said electronic program guide data only for those entries 
in said electronic program guide (914) which have content profiles which are determined by said relating means 
to most closely match said at least one customer profile. 

19. A system as in claim 11, wherein said customer is connected to a plurality of data sources (402; 602; 1112) 
over a data network, and wherein said presenting means includes means for downloading said subset of data to 
said customer prior to selection by said customer of a menu item identifying said subset of data. 

20. A system as in claim 1 9, further comprising: 

means (412, 620, 1008) for monitoring a customer's pattern in navigating through said data sources to 
access data having characteristics desirable to said customer; and 

means (412; 506; 606; 906; 1006; 1102) for creating at least one customer profile for said customer 
indicating the customer's pattern in navigating through said data sources to access said data having 
characteristics desirable to said customer, 

wherein said downloading means downloads said data having characteristics desirable to said customer prior 
to selection by said customer of menu items used on said data network for navigating to said data. 

21. A system as in claim 11, further comprising means (918; 1104) if or identifying a customer and for providing 
customer profiles for said customer to said determining means for determining said subset of data for said customer. 

22. A system as in claim 11, further comprising means for providing different customer profiles to said 
determining means in accordance with the time of day and day of the week. 

23. A system as in claim 11, wherein said presenting means is arranged to present said subset of data to the 
customer as a virtual data channel. 

24. A system as in claim 11, further comprising means for storing an electronic program guide (914) at a 
customer's set top terminal (412; 512; 620), wherein said presenting means is arranged to highlight programs 



within said electronic program guide which correspond to said subset of data. 

25. A system as in claim 24, wherein said electronic program guide storing means is arranged to store program 
descriptions of said electronic program, guide (914) only for those entries in said electronic program guide 
(914) which have content profiles which are determined by said determining means to most closely match said at 
least one customer profile. 

26. A system as in claim 11, further comprising an interface which through which said determining means is 
arranged to provide upstream data messages. 

27. A system as in claim 26, further comprising encrypting means (1 022) for encrypting said upstream data messages. 



Patentanspriiche 

1. Verfahren zum Planen eines Kunden- bzw. Teilnehmerzughffs auf Daten einer Vielzahl von Datenobiekten 
bzw. -gegenstanden, umfassend die Schritte: 

Erzeugen (104; 110, 112) von wenigstens einem Kunden- bzw. Teilnehmerprofil fur einen Kunden dieser 
Daten, wobei das Kundenprofil die Vorlieben des Kunden fur " ~' ^ ~ 

Daten anzeigt, welche vorbestimmte Merkmale aufweisen; 

Erzeugen von Inhalts- bzw. Contentprofilen fur jedes dieser Datenobjekte, wobei die Inhaltsprofile das 
Ausma ft des Inhalts der vorbestimmten Merkmale in Daten von jedem dieser Datenobjekte anzeigen; 

Inbezugsetzen (106; 304) des wenigstens einen Kundenprofils mit den Inhaltsprofilen fur die Daten welche 
von jedem Datenobjekt verfugbar sind, an den Kunden zu einer bestimmten Zeit; 

Bestimmen (204, 206; 304, 310) eines Subsatzes von Daten, welcher Inhaltsprofile aufweist, von welchen in 
dem Bezugnahmeschntt bestimmt wurde, daft sie am nachsten mit dem wenigstens einen KundenDrofii 
ubereinstimmen; und H 

Prasentieren des Subsatzes von Daten an den Kunden fur eine Auswahl; 
dadurch gekennzeichnet, daft: 

die Inhaltsprofile fur jedes der Datenobjekte Werte uber einen vorbestimmten Bereich von Werten enthalten 
welche eine Ubereinstimmung der Datenobjekte mit den vorbestimmten Merkmalen anzeigen; und 

der Schritt (204, 206; 304, 310) eines Bestimmens eines Subsatzes von Daten einen Vergleich umfaftt urn 
eine Anzeige der Attraktrvitat der Datenobjekte fur jeden Kunden zu erzeugen. 



2. Verfahren nach Anspruch 1, worin der Bezugnahmeschritt (106; 304) den Schritt eines Bestimmens eines 
Abstands zwischen einem Kundenprofil und einem Inhaltsprofil in einem charakteristischen Raum durch ein 
Berechnen eines Uberemstimmungsskalars fur gemeinsame Merkmale, ac, zwischen dem wenigstens einen 
Kundenprofil, cv, und den Inhaltsprofilen, cp, in Ubereinstimmung mit der Beziehung: 



ac 



= 1/M + E.w.. lev. - co.. II 

K IK « IK • JK'J 



. '] IS IIS " In * 

fur i = em gewisser Kunde einer Anzahl von Kunden I, j = ein bestimmtes Datenobjekt einer Anzahl von 
Datenobjekten J, und k = ein bestimmtes Merkmal eines Datenobjekts einer Anzahl von Datenobjektmerkmalen K 
umfaftt, worm cv ik grofter als oder gleich 0 ist und w jk ein normalisiertes Gewicht eines Merkmals k eines Kunden i 
ist. 



3. Verfahren nach Anspruch 1 , weiters umfassend die Schritte: 

Uberwachen (110), auf welche Datenobjekte tatsachlich durch den Kunden zugegriffen wird; und 

Aktualisieren (112; 310) von jedem Kundenprofil in Ubereinstimmung mit den Inhaltsprofilen der 
Datenobjekte, auf welche tatsachlich durch den Kunden zugegriffen wird, urn die tatsachlichen bzw aktuell-n 
Vorlieben des Kunden fur die vorbestimmten Merkmale zu aktualisieren. 
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4. Verfahren nach Anspruch 1 , weiters umfassend die Schritte: 
Oberwachen (110), auf welche Datenobjekte tatsachlich durch den Kunden zugegriffen wird; und 

Aktualisieren (112) des Kundenprofils, urn die Frequenz einer Auswahl der Datenobjekte durch Kunden mit 
Kundenprofilen im wesentlichen ahnlich zu dem Kundenprofil zu reflektieren. 

5. Verfahren nach Anspruch 4, worin der Erzeugungsschritt (1 04; 1 1 0, 1 1 2) eines Kundenprofils die Schritte umfaRt: 

Auswahlen einer Anzahl von gewOnschten Gruppen K, in welche die Kunden bzw. Teilnehmer unterteilt 
werden. wobei jeder Kunde in einer Gruppe ein Kundenprofil im wesentlichen ahnlich bzw. gleich zu einem 
Kundenprofil von jedem anderen Kunden in dieser Gruppe aufweist; 

Gruppieren der Kunden in K Gruppen, urn zu minimieren: 

Cluster Kunden 
k=1 bis K i in k 

worin |v, - v k | ein Abstand zwischen dem Vektor von Merkmalen der Datenobjekte, auf welche durch den Kunden i 
zugegriffen wird, und dem Schwerpunkt bzw. Zentrum der Gruppe k ist- und 

Bestimmen einer Ubereinstimmungsmatrix ac ijT wo fur jeden Kunden j eine j-te Reihe der Obereinstimmungsmatrix 
ein Vektor v k fur eine Gruppe k ist, zu welcher der Kunde i gehort. 

6. Verfahren nach Anspruch 1 , worin der Schritt (1 04; 1 1 0, 1 1 2) eines Erzeugens von wenigstens einem Kundenprofil 
dls Sstens 'JflfiSSiX^ Da ' en ' We ' Che V ° rher dUrCh KundSn ZUge9rfen ™* in 

30 7. Verfahren nach Anspruch 1 worin der Schritt (104; 110, 112) eines Erzeugens von Inhaltsprofilen den Schritt 
eines Aufnehmens in die Inhaltsprofile von Daten umfaRt, welche die Kundenprofile dieser Kunden reflektieren 
welche vorher auf die Daten von jedem Datenobjekt zugegriffen haben. "e^eren, 

8. Verfahren nach Anspruch 1, worin die Daten elektronische Programmfuhrungsdaten beinhalten und der 
35 Darstellung^ bzw. Prasentierungsschritt den Schritt eines Herunteriadens in einen Speicher einer eine 

35 eekrische Programmfuhrung aufnehmenden Vorrichtung eines Kunden von Programmbeschreibungen inTer 

SSK? Pr °9 rammfuhm , n 9 nur fur diejenigen Eintrage in der elektronischen Programmfuhrung umfaRt, welche 
Inhaltsprofile aufwe.sen, welche ,n dem Bezugnahmeschritt bestimmt wurden, daB sie am nachsten mit dem 
wenigstens einen Kundenprofil Obereinstimmen. 

40 9. Verfahren nach Anspruch 1, worin der Kunde mit einer Vielzahl von Datenquellen (406- 602' 1112) Qber ein 
Datennetzwerk verbunden wird und worin der Prasentierungsschritt den Schritt eines Herunteriadens des 
Subsatzes von Daten an den Kunden von der Datenquelle vor einer Auswahl durch den Kunden eines 
Menugegenstands umfaBt, welcher den Subsatz von Daten identifiziert bzw. bezeichnet. 

1 0. Verfahren nach Anspruch 9, weiters umfassend die Schritte' 

45 

S™^",' S f in l S MU f St ! 6ineS KUnden bSi 8inem Navi 9 ieren durc h die Datenquellen, urn auf Daten 
zuzugreifen, welche fur den Kunden wunschenswerte Merkmale aufweisen; und 

Erzeugen (112; 310) von wenigstens einem Kundenprofil fur den Kunden, welches das Muster des Kunden 
beim Navig,eren durch die Datenquellen anzeigt, urn auf die Daten zuzugreifen, welche fur den Kunden 
wunschenswerte Merkmale aufweisen; 

wnn^h!L HS H t Bd ^ m W , 8iterS d6n Schri " 8ineS Hsnj nteriadens dieser Daten, welche fur den Kunden 
wunschenswerte Merkmale aufweisen, aus den Datenquellen vor einer Auswahl durch den Kunden von 
Menugegenstanden umfaRt, welche an dem Datennetzwerk fur ein Navigieren zu diesen Daten verwendet werden 

55 

11. Datenubertragungssystem, welches einen Teilnehmer- bzw. Kundenzutritt zu Daten einer Vielzahl von 



Datenobjekten plant, umfassend: 



Mittel (406; 412; 506; 606; 906; 1006; 1102) zum Erzeugen von wenigstens einem Kunden- bzw. 
Teilnehmerprofil fur einen Kunden der Daten, wobei das Kundenprofil die Vorlieben des Kunden fur Daten 
anzeigt, welche vorbestimmte Merkmale aufweisen; 

Mittel zum Erzeugen von In halts- bzw. Contentprofilen fur jedes Datenobjekt, wobei die Inhaltsprofile das 
5 Ausmafi. des Inhalts der vorbestimmte n Merkmale in Daten von jedem Datenobjekt anzeigen; 

Mittel (406; 412; 506; 606; 906; 1006; 1102), urn das wenigstens eine Kundenprofil mit den Inhaltsprofilen 
fur die Daten, die von jedem Datengegenstand erhaltlich sind, mit dem Kunden zu einem gewissen Zeitpunkt in 
Beziehung zu setzen; 

10 

Mittel (406; 412; 506; 606; 906; 1006; 11 02) zum Bestimmen eines Subsatzes von Daten, welche Inhaltsprofile 
aufweisen, welche am nachsten mit dem wenigstens einen Kundenprofil ubereinstimmen; und 

Mittel (110) zum Prasentieren des Subsatzes von Daten an den Kunden fur eine Auswahl, dadurch 
gekennzeichnet, dad: 

15 

die Inhaltsprofile fur jedes der Datenobjekte Werte uber einen vorbestimmten Bereich von Werten 
enthalten, welche eine Ubereinstimmung der Datenobjekte mit den vorbestimmten Merkmalen anzeigen; 
und 

2Q die Mittel (406; 412; 506; 606; 906; 1006; 1102) zum Bestimmen eines Subsatzes von Daten Mittel zum 

" Erzeugen eines Vergleichs der Attraktivitat der Datenobjekte fur jeden Kunden umfassen. 



12. System nach Anspruch 11, worin die Bezugnahmemittel einen Prozessor (406; 412; 506; 606; 906; 1006; 1102) 
25 umfassen, welcher programmiert ist, um einen Abstand zwischen einem Kundenprofil und einem Inhaltsprofil in 

. einem charakteristischen Raum durch ein Berechnen eines Ubereinstimmungsskalars fur gemeinsame Merkmale, 
ac, zwischen dem wenigstens einen Kundenprofil, cv, und den Inhaltsprofilen, cp, in Ubereinstimmung mit der 
Beziehung: 

ac^/11+^wJcv^-cp^ 

30 fur i = ein gewisser Kunde einer Anzahl von Kunden I, j = ein bestimmtes Datenobjekt einer Anzahl von 

Datenobjekten J, und k = ein bestimmtes Merkmal eines Datenobjekts einer Anzah! von Datenobjektmerkmalen K 
zu bestimmen, worin cv ik grofter als oder gleich 0 ist und w jk ein normalisiertes Gewicht eines Merkmals k eines 
Kunden i ist. 

35 13. System nach Anspruch 1 1 , weiters umfassend: 

Mittel (412; 620; 1008) zum Uberwachen, auf welche Datenobjekte tatsachlich durch den Kunden zugegriffen 
wird; und 

Mittel (412; 620; 1008) zum Aktualisieren von jedem Kundenprofil in Ubereinstimmung mit den Inhaltsprofilen 
40 der Datenobjekte, auf welche tatsachlich durch den Kunden zugegriffen wird, um die tatsachlichen bzw. 

aktuelien Vorlieben von jedem Kunden fur die vorbestimmten Merkmale zu aktualisieren. 
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14. . System nach Anspruch 1 1 , weiters umfassend: 

Mittel (412; 620; 1008) zum Uberwachen, auf welche Datenobjekte tatsachlich durch den Kunden zugegriffen 
wird; und . 

Mittel (412; 620; 1008) zum Aktualisieren von jedem Kundenprofil, um die Frequenz einer Auswahl der 
Datenobjekte durch Kunden mit Kundenprofilen im wesentlichen ahnlich zu dem Kundenprofil zu reflektieren. 



15. System nach Anspruch 14, worin die Mittel (412; 620; 1008) zum Erzeugen des wenigstens einen Kundenprofils 
Bearbeitungsmittel umfassen, welche eine Anzahl von gewunschten Gruppen K, in welche die Kunden bzw. 
Teiinehmer unterteilt werden, wobei jeder Kunde in einer Gruppe ein Kundenprofil im wesentlichen ahnlich bzw. 
gleich zu dem Kundenprofil von jedem anderen Kunden in dieser Gruppe auswahlt, die Kunden in K Gruppen 
55 gruppiert, um zu minimieren: 



2 2 |vj-vj 

Cluster Kunden 
k=1 bis K i in k 

^S^dt^ «* we„, a .„ s einem 

den Kunden zugegriffen wurde. ™™enpror» emes Profils von Daten bemhaltet, auf welche vorher durch 

dLT r Sn"etSn E wX vol '"^f^r? t ' nhaI ^^" ^haiten. 
zugegriffen haben. . reflektieren, welche vorher auf die Daten von jedem Datenobjekt 

1 %a S r£rng^ beinhalten und die 

Programming aufnehmenden VoSchtung eines kunden TXoaraZL^'T e,ner e ' ektrische 
Programmfuhrung (914) in den elektronischen P™«Ss Programmbeschreibungen emer elektrischen 
elektronischen Programming r1l^^^SS2^ nUr ? dtejeni9en in der 

Bezugnahmemitte, bestimmt wurdl, L s!e am n Sh^dem^ 

Daten an den Kunden vor einer AusSl duro TaSnSSS^iS ^ H ^ m nte ( riaden des s ^satzes von 
von Daten identifiziert bzw. bezeichnet Menugegenstands umfaBt, welcher den Subsatz 

20. System nach Anspruch 1 9, weiters umfassend: 

sr s i2 das 5o Su 6 sS s ^^s^^r^Tr ei T Kundenprofii * r <*■ ^ 

zuzugreffen, welche fur den Kunden S^eS an2e ^ u ™ auf D*en 

A^ d ^ SnT e ;r hen fr e Merkma,e - 

Navigierenzudiesen Daten verwendet werden he ^ntedaden. welche an dem Datennetzwerk fur ein 

. System nach Anspruch 11, weiters umfassend Mittel C91R- iirvn „, m m ^ ■ 
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Eintrage in der elektronischen Programmfuhrung (914) zu speichern, welche Inhaltsprofile aufweisen, von welchen 
durch die Bestimmungsmittel bestimmt wurde, dafi sie am nachsten mit dem wenigstens einen Kundenprofil 
ubereinstimmen. 

26. System nach Anspruch 11, weiters umfassend ein Interface, durch welches die Bestimmungsmittel ausgebildet 
■ sind, Upstream-Datenbotschaften bzw. stromaufwartige Datenbotschaften zur Verfugung zu stellen. 

27. System nach Anspruch 26, welters umfassend Verschlusselungsmittel (1022) zum Verschlusseln bzw. Codieren 
der Upstream-Datenbotschaften. 



10 Revendications 

1. Procede de programmation de faeces d'utilisateurs a des donnees d'une multiplicity d'objets de donnees, 
comportant les etapes consistant a : 

creer (104 ; 110, 112) au moins un profil d'utilisateur pour un utilisateur desdites donnees, ledit profil 
15 d'utilisateur indiquant les preferences de I'utiiisateur pour des donnees ayant des caracteristiques 

predeterminees ; 

creer des profils de contenu pour chacun desdits objets de donnees, lesdits profils de contenu Indiquant le 
degre de contenu desdites caracteristiques predeterminees dans les donnees de chacun desdits objets de 
donnees ; 

20 

lier (106 ; 304) ledit au moins un profil d'utilisateur avec les profils de contenu pour les donnees 
disponibles a partir de chaque objet de donnees pour I'utilisateur a un moment particulier ; 

determiner (204, 206 ; 304, 310) un sous-ensemble de donnees ayant des profils de contenu qui sont 
25 determines dans ladite etape de liaison pour correspondre au plus pres au dit au moins un profil d'utilisateur ; et 

presenter ledit sous-ensemble de donnees audit utilisateur pour selection ; 

caracterise en ce que : 

30 lesdits Profils de contenu pour chacun desdits objets de donnees contiennent des valeurs sur une plage de 

valeurs predetermined indiquant la correspondance des objets de donnees avec lesdites caracteristiques 
predeterminees ; et 

ladite- etape (204, 206 ; 304, 310) de determination d'un sous-ensemble de donnees comporte une 
comparaison afin de generer une indication de I'attractivite desdits objets de donnees pour chaque utilisateur 
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2. Procede selon la revendication 1, selon lequel ladite etape de liaison (106 ■; 304) comporte I'etape de 
determination d'une distance entre un profil d'utilisateur et un profil de contenu dans .un espace 
caracteristique en calculant une grandeur scalaire d'accord pour des caracteristiques communes, ac, entre ledit au 
40 moins un profil d'utilisateur, cv, et lesdits profils de contenu, cp, selon la relation : ■ ■ 



acy = 1/[1 + £ v/fc |cv ik - cp jk |] 
k 

avec i = un utilisateur particulier d'un nombre d'utilisateurs I, j = un objet de donnees particulier d'un 
nombre d'objets de donnees J, et k = une caracteristique particuliere d'un objet de donnees d'un nombre de 
caracteristiques d'objet de donnees K, ou cv ik est superieur ou egal a 0 et w ik est le poids normalise d'utilisateur 
i de caracteristique k. 

3. Procede selon la revendication 1, comportant les etapes supplementaires consistant a : 
surveiller (1 1 0) a quels objets de donnees accede reellement ledit utilisateur ; et 
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mettre a jour (112 ; 310) chaque profil d'utilisateur en fonction des profils de contenu des objets de 
donnees auxquels accede reellement cet utilisateur afin de metlre a jour des preferences reelles de cet 



utilisaleur pour lesdites caracteristiques predeterminees. 



4. Precede selon la revendication 1 , comportant les etapes supplementaires consistant a : 

surveiller (1 10) a quels objets de donnees accede reellement ledit utilisateur ; et 

mettre a jour (112) ledit profil d'utilisateur afin de refleter la frequence de la selection des objets de 
donnees par des utihsateurs avec des profils d'utilisateur sensiblement similaires audit profil d'utilisateur. 

5. Procede selon la revendication 4, selon lequel ladite etape de creation de profil d'utilisateur comporte les 
©tspss consistsnt 3 . 

selectionner un nombre de groupes souhaites K dans lesquels lesdits utilisateurs sont divises de sorte que 
chaque utriisateur dans un groupe a un profil d'utilisateur sensiblement similaire a un D nS 
d'utilisateur dechaque autre utilisateur dans ledit groupe; P 

grouper lesdits utilisateurs en K groupes afin de reduire au minimum : 



grappes utilisateurs 

k=laK idansk 

°° !'V Vkl diStanCS Snlre ' e V8CteUr deS cara ct6ristiques des objets de donnees consultes par 
I utilisateur i et le centre de gravite du groupe k ; et 

determiner une matrice d'accord aq,, ou pour chaque utilisateur i, une j' me rangee de la matrice d'accord est 
un vecteur v k pour un groupe k auquel appartient I'utilisateur i. 

Procede selon la revendication 1, selon lequel ladite etape (104 ; 1 10, 1 12) de creation d'au moins un profil 
d utilisateur comporte 'etape consistant a inclure dans ledit au moins un profil d'utilisateur un profil de 
donnees precedemment accede par ledit utilisateur. 

7. Procede selon la revendication 1, selon lequel ladite etape (104 ; 110, 112) de creation des profils de contenu 
comprend letape constant a inclure dans lesdits profils de contenu des donnees refletant les profils 
d utihsateur des utilisateurs qui ont precedemment consulte lesdites donnees de chaque objet de donnees 

8. Procede selon la revendication 1, selon lequel lesdites donnees comprennent des donnees de guide de 
programme electronique, et ladite etape de presentation comporte I'etape de telechargement, dans une memoire 

recept,on , de 9 uide de Programme electronique d'utilisateur, de descriptions de programme dans 
ledit guide de programme electromque seulement pour les entrees dans ledit guide de programme electronTque qui 

'a ^^^ZZr *" ' adite «~ dS ^ * de —p-dre au plus JKJ 
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dit au moins un profil d'utilisateur. 

9 ' H P nnn C t d c Sln° n in-T^of ^ 1f Se '° n ^ Uti ' iSaleUr est reli * * une multiplied de sources de 

donnees (40o , 602 ; 112) sur un reseau mformatique, et selon lequel ladite etape de presentation comporte 
letape de telechargement dudit sous-ensemble de donnees pour ledit utilisateur a parti? de SS soura Te 
donnees avant la selection par ledit utilisateur d'un element de menu identifiant ledit sous-ensemble de donnees" 

1 0. Procede selon la revendication 9, comportant les etapes supplementaires consistant a : 

surveiller (110) une configuration d'utilisateur dans la navigation a travers lesdites sources de donn-s 
afin d acceder a des donnees ayant des caracteristiques souhaitables pour ledit utilisateur ; et 

creer (112 ; 310) au moins un profil d'utilisateur pour ledit utilisateur indiquant la configuration 
d utilisateur dans la natation a travers lesdites sources de donnees afin d'acceder aux dites donnees 
ayant des caracteristiques souhaitables pour ledit utilisateur, aonne.s 



ladite etape de telechargement comportant en outre I'etape de telechargement desdites donnees ayant des 
caractenstiques souhaitables pour ledit utilisateur a partir desdites sources de donnees avant la 
selection par ledit utilisateur des elements de menu utilises sur ledit reseau informatique afin de 
naviguerjusqu'auxdites donnees. 

11. Systeme de transmission de donnees qui programme I'acces d'utilisateur a des donnees d'une multiplicite 
d'objets de donnees, comportant : K 

des moyens (406 ; 412 ; 506 ; 606 ; 906 ; 1006 ; 1102) destines a generer au moins un profil d'utilisateur 
pour un utilisateur desdites donnees, ledit profil d'utilisateur indiquant les preferences de I'utilisateur 
pour des donnees ayant des caracteristiques predeterminees ; 

des moyens destines a generer des profits de contenu pour chaque objet de donnees lesdits profils de 
contenu indiquant le degre de contenu desdites caracteristiques predeterminees dans les donnees de chaaue 
objet de donnees ; M 

des moyens (406 ; 412 ; 506 ; 606 ; 906 ; 1006 ; 1102) destines a lier ledit au moins un profil 
d utilisateur avec les profils de contenu pour les donnees disponibles a partir de chaque obi-t pour 
I utilisateur a un moment particulier; J K 

des moyens (406 ; 412 ; 506 ; 606 ; 906 ; 1006 ; 1 102) destines a determiner un sous-ensemble de donnees 
ayant des profils de contenu qui correspondent le plus pres audit au moins un profil d'utilisateur ; et 

des moyens (110) destines a presenter ledit sous-ensemble de donnees audit utilisateur Dour une 
selection, caracterise en ce que : K 

lesdits profils de contenu pour chacun desdits objets de donnees contiennent des valeurs sur une place 
predetermine de valeurs indiquant la correspond ance des objets de donnees avec lesdites 
caracteristiques predeterminees ; et 

lesdits moyens (406 ; 412 ; 506 ; 606 ; 906 ; 1006 ; 1102) destines a determiner un sous-ensemble de 
donnees comportent des moyens destines a generer une comparaison de I'attractivite desdits obiets de 
donnees pour chaque utilisateur. J 



12. Systeme selon la revendication 11, dans lequel lesdits moyens de liaison comportent un processeur (406 1 412 * 
506 ; 605 ; 906 ; 1006 ; 1102) programme afin de determiner une distance entre un profil d'utilisateur et un 
profil de contenu dans un espace caracteristique en calculant une grandeur scalaire d'accord pour des 
caracteristiques communes, ac, entre ledit au moins un profil d'utilisateur, cv, et lesdits profils de contenu 
cp, selon la relation : ' 

acy = + £ Wk lev* - cp jk |] 
k 



avec i = un utilisateur particulier d'un nombre d'utilisateurs I, j = un objet de donnees particulier d'un 
nombre d objets de donnees J, et k = une caracteristique particuliere d'un objet de donnees d'un nombre de 
caractenstiques a'objet ae donnees K r ou cv ik est superieur ou egal a 0 et w ik est le poids normalise d'utilisateur 
i de caracteristique k. 

13. Systeme selon la revendication 1 1 , comportant en outre : 

des moyens (412 ; 620 ; 1008) destines a surveiller a quels objets de donnees accede reellement chaqu- 
utilisateur ; et H 

des moyens (412 ; 620 ; 1008) destines a mettre a jour chaque profil d'utilisateur en fonction des profils 
de contenu des objets de donnees auxquels accede reellement cet utilisateur afin de mettre a jour des 
preferences reelles de chaque utilisateur pour lesdites caractenstiques predeterminees. 



1 4. Systeme selon la revendication 1 1 , comportant en outre : 
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2StateTrt (412 : 620 : 1008) destin6s h surveiller * qiJels objets de donn§es acc§de reellement cha q^e 

des moyens (412 ; 620 ; 1008) destines a mettre a jour chaque profil d'utilisateur afin de refleter la 
frequence de selection des objets de donnees par des utilisateurs avec des profils d'utilisateur 
sensiblement similaires audit profil d'utilisateur. "»=>=««ur 

15. Systeme selon la revendication 14, dans lequel lesdits moyens (412 ; 620 ; 1008) destines a generer au moins 
un profil dut isateur comportent des moyens de traitement qui selectionnent un nombre de groupes souhaites K 
dans lesquels lesdits utilisateurs sont divises de sorte que chaque utilisateur dans un groupe a un profil 

10 d utilisateur sensiblement similaire au profil d'utilisateur de chaque autre utilisateur dans ledit groupe ■ 

regroupent lesdits utilisateurs en K groupes afin de reduire au minimum: ' 

15 grappes utilisateurs 

k=laK idansk 

oil |v, - v k | est une distance entre le vecteur des caracteristiques des objets de donnees consultes par 
2Q I utilisateur i et le centre de gravite du groupe k ; et determined une matrice d'accord a Cij , ou pour chaque 

utilisateur i, une j e ™ rangee de la matrice d'accord est un vecteur v k pour un groupe k auquel appartient ('utilisateur i. 

16. Systeme selon la revendication 11, dans lequel lesdits moyens (412 ; 620 ; 1008) destines a generer au moins 
un profil d utilisateur comprennent dans ledit au moins un profil d'utilisateur un profil de donnees 
precedemment accede par ledit utilisateur. 

25 

17. Systeme selon la revendication 11, dans lequel lesdits moyens destines a generer des profils de contenu 
comprennent dans lesdits profils de contenu des donnees refletant les profils d'utilisateur des utilisateurs qui 
ont precedemment consulte lesdites donnees de chaque objet de donnees. 

30 18. Systeme selon la revendication 11. dans lequel lesdites donnees comprennent des donnees de guide de 
programme electronique, et lesdits moyens de presentation comportent des moyens de telechargement dans un- 
memo.re d un dispositif de reception de guide de programme electronique d'utilisateur, de descriptions de 
programme dun guide de programme electronique (914) dans lesdites donnees de guide de proqramme 
electronique seu ement pour les entrees dans ledit guide de programme electronique (914) qui ont des profils de 

35 profiTdStffeateu? determ,nes par lesdits ™Y ens de liaison afin de corresponds au plus pres au dit au moins un 

19 Systeme selon la revendication 11, dans lequel ledit utilisateur est relie a une multiplicite de sources d» 
donnees (40c » ; 602 ,112) sur un reseau informatique, et dans lequel lesdits moyens de presentation comprennent 
des moyens de telechargement dudit sous-ensemble de donnees pour ledit utilisateur avant la selection par ledit 
utilisateur d un element de menu identifiant ledit sous-ensemble de donnees. 

20. Systeme selon la revendication 1 9, comportant en outre : 

des moyens (412 ; 620 ; 1008) destines a surveiller une configuration H'..»nis»>°ur d=>n S i=. nzuin^n = 
travers lesdites sources de donnees afin d'acceder a des donnees ayant des caracteristiques souhaitables 
45 pour ledit utilisateur ; et 

des moyens (412 ; 506 ,606 ; 906 ; 1006 ; 1102) destines a creer au moins un profil d'utilisateur pour 
ledit utilisateur mdiquant la configuration d'utilisateur dans la navigation a travers lesdites sources de 
donnees afin d acceder aux dites donnees ayant des caracteristiques souhaitables pour ledit utilisateur, 

lesdits moyens de telechargement telechargeant lesdites donnees ayant des caracteristiques souhaitables 
avant la selection par ledit utilisateur d'elements de menu utilises sur ledit reseau informatique afin d« 
naviguerjusqu aux dites donnees. ^ 



50 



55 21 ■ ..f^ me s f l ° n la revendication 11, comportant en outre des moyens (918 ; 1104) destines a identifier un 
utilisateur et destines a delivrer des profils d'utilisateur pour ledit utilisateur aux dits moyens de 



determination destines a determiner ledit sous-ensemble de donnees pour ledit utilisateur. 

22. Systeme selon la revendication 11, comportant en outre des moyens destines a delivrer differents profils 
d'utilisateur auxdits moyens de determination en fonction de I'heure et du jour de la semaine. 

23. Systeme selon la revendication 11, dans lequel lesdits moyens de presentation sont prevus pour presenter ledit 
sous-ensemble de donnees a I'utilisateur sous forme d'un canal virtuel de donnees. 

24. Systeme selon la revendication 11, comportant en outre des moyens destines a stocker un guide de programme 
electronique (914) a un terminal d'utilisateur (4.12 ; 512 ; 620), lesdits moyens de presentation etant prevus 
pour mettre en evidence des programmes dans ledit guide de programme electronique qui correspondent audit 
sous-ensemble de donnees. 

25. Systeme selon la revendication 24, dans lequel lesdits moyens de stockage de guide de programme electronique 
sont prevus pour stocker des descriptions de programme dudit guide de programme electronique (914) seulement 
pour les entrees dans ledit guide de programme electronique (914) qui ont des profils de contenu qui sont 
determines par lesdits moyens de determination pour correspondre au plus pres audit au moins un profil 
d'utilisateur. 

26. Systeme selon la revendication 11, comportant en outre une interface par I'intermediaire de laquelle lesdits 
moyens de determination sont prevus pour delivrer des messages de donnees montants. 

27. Systeme selon la revendication 26, comportant en outre des moyens d'encryptage (1022) destines a chiffrer 
lesdits messages de donnees montants. 
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HEAD END: 

ENCRYPT SEED RANDOM 
NUMBER N AS E(N,P) 
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SET TOP 



710 
712 



T 



SET TOP: 
RECEIVE E(N,P) 



SET TOP: 
DECRYPT E(N,P) 
YIELDING N USING SET 
TOP PRIVATE KEY Q 



HEAD END A 
INITIALIZE F 
NUMBER GEf 
WITH N AS . 


ND SET-TOP: 
'SUEDO RANDOM 
1ERAT0RS 
SEED 


» 


716 
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DOWNSTREAM ENCRYPTION: FROM HEAD END 
TO SET TOP (VIDEO STREAM) 
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SET TOP : 
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